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Healing traumatized minds 


Host countries need to deal with the raised levels of mental-health disorders in refugees if they expect 
them to integrate well, and that could mean benefits for psychological care in rich countries too. 


«Cc ‘ir schaffen das!” proclaimed Germany's chancellor 
Wiis ‘Merkel at the peak of the refugee crisis in August 
last year. “We will cope” 

Her promise met with widespread acclaim and the German publicin 
turn proffered a Willkommenskultur or ‘culture of welcome. But within 
months, as hundreds of thousands more refugees entered the country, 
the phrase began to hang heavy around Merkel’s neck — and could yet. 
sink her. And the Willkommenskultur must confront a growing shock- 
wave of xenophobia, pushed ahead of the waves of displaced people. 

‘Merkel’s phrase had of course been referring to the German popu- 
lation, who have to cope with finding accommodation, education 
and jobs for the refugees, as well as providing health services. This 
costly range of social adjustments will be more challenging even than 
the 1990 reunification, (Germany is the most popular destination in 
Europe for refugees.) 

‘The refugees have to cope too, Yet in the highly charged public 
debate about the refugee crisis, the mental state of these vulnerable 
people rarely features — aside from vague and nakedly political 
warnings that some could be dangerous. 

Refugees must cope with having been driven from their homes by 
violence and fear, and arriving in a foreign country with nothing. They 
need clear heads to make good decisions about their immediate and 
longer-term futures. They need flexibility of mind to adapt to their 
new, often disappointing, environments, They need to learn new things 
quickly, not least the language of their host countries, to meet the expec- 
tations that they will integrate quickly, But the existential stresses faced 
by the refugees at home and on their dangerous journeys has taken a 
disastrous toll on the minds of many of them. 


THERAPY DEVELOPMENT 
Ina News Feature on page 158 this week (of which Monday was World 
Mental Health Day), Nature examines some of the issues. The head- 
line figure is deeply concerning. Psychologists in Germany estimate 
that more than half of those who have recently arrived there could 
be affected by post-traumatic stress disorder, depression, anxiety or 
another mental disorder. This is not a good basis for coping — for 
decision-making, adapting and learning, Such mental-health condi- 
tions reduce cognitive capacities and suck energy and motivation. 
‘Medics and others who have worked with traumatized populations in 
far-flung war zones, such as Cambodia, Vietnam, sub-Saharan Africa, 
the Balkans and the Middle East, are familiar with this, The World 
Health Organization and the United Nations High Commissioner for 
Refugees published new guidelines last year that stress the importance 
of noticing and addressing mental-health issues. Yet host countries tend 
not to realize the extent of the problem, or to argue that mental-health 
problems can be tackled after the refugees are settled. But the right 
moment for support is during the volatile times. That makes sense for 
practical as well as humanitarian reasons — to ensure the smoothest 


passage to integration for those who have no prospect ofa return home. 

Some German cities are rolling out some modest pilot programmes 
for psychiatric help, and Sweden will do so soon, But, considering 
the numbers affected, a very large investment is required across the 
continent. The European Union registered 1.4 million people seek- 
ing asylum in the 18 months up to June this year, and hundreds of 
thousands more may have entered without registering, according to 

German estimates. 


“Refugees must ‘Access to mental-health provision is 
copewithhaving already difficult for many citizens. Prioritiz- 
beendrivenfrom _ ing work with new arrivals isa tough sell. So 
their homes it is important to consider the payback for 
by violence the wider society. 

and fear.” Huge numbers of traumatized refugees in 


the Middle East and Africa are camped in 
countries with few psychiatrists, or are caught in areas too dangerous 
for aid workers to access, Their fate has accelerated efforts to develop 
simple and cheap therapies, some Internet- or app-based, as alternatives 
(or supplements) to conventional contact-heavy therapies. To broaden 
delivery, these can often be administered by trained lay people. 

Clinical psychologists and psychiatrists now want to properly test 
these new therapies, which are based on the most up-to-date under- 
standing of the brain and cognition, among refugees in Europe. Apart 
from the immediate relief they could bring, and the consequent chance 
of faster integration, there are two main reasons to encourage this. 

These efforts will help to refine the therapies for application in all 
refugee centres, wherever and whenever war breaks out. And they 
will also help to break down barriers to modern approaches to clinical 
psychology in Europe, where the discipline has become conservative 
and complacent. Too many psychologists are reluctant to consider 
how mobile-device and Internet-based approaches could supplement 
standard therapies, and are too resistant to the concept that anyone 
who is not a qualified psychologist could help. The experience with 
refugees might also inspire improvements in local access to mental- 
health provision by generating, through necessity, a system that works 
faster and has fewer barriers. 

Much could also be learnt from an ‘employment buddy system’ in 
Germany called ‘Wir zusammen’ or ‘We together: The strongest signal 
of successful integration is entry into the work force, but this can be 
fraught with diverse and unpredictable problems. Wir zusammen isa 
‘movement of chief executives who create jobs or training positions for 
refugees that come with on-staff mentors — volunteers who oversee 
and champion their charges and who can accompany them to offi- 
cial appointments. . This practical support could reduce the gnawing 
stresses that undermine mental health. 

“We together’ and ‘We will cope’ are feel-good phrases, but they 
should not be dismissed as platitudes. When it comes to mental health, 
they are fundamental. w 
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Genetic reckoning 


Researchers need to reassess many accepted 
links between mutations and disease. 


ne of the major findings of the Exome Aggregation 
O Consortium (ExAC), the largest-ever catalogue of genetic 

variation in the protein-coding regions of the human genome, 
is that many genetic mutations have been misclassified as harmful 
(M. Lek et al. Nature 536, 285-291; 2016). Authors of that study 
estimate that each person has lurking in their genome an average of 
54 mutations that are currently considered pathogenic — but that 
about 41 of these occur so frequently in the human population that 
they aren't in fact likely to cause severe disease, That finding is having 
major consequences for some people with such variants, lifting the 
equivalent of genetic death sentences (see page 154). 

‘That raises two challenges for researchers: how to sort out which 
mutations currently considered pathogenic are actually benign, and 
how to apply more rigorous tests to future research that aims to find 
the genetic causes of disease. 

Working out which mutations are actually linked to illness will be 
along and arduous task. For instance, geneticist and physician Leslie 
Biesecker of the US National Human Genome Research Institute in 
Bethesda, Maryland, found thata patient referred to him for diagnosis 
harboured a genetic variant that had been linked to kidney failure. Yet 
it turned out that the variant was too common in ExAC to realisti- 
cally be causing a rare kidney ailment. So Biesecker checked genome 
sequences from 950 people whom he had previously sequenced in a 
study called ClinSeq (K. L, Lewis et al, PLoS ONE 10, 0132690; 2015). 
Five of them had the same variant, with no history of kidney disease, 
indicating that the variant probably does not actually cause this illness. 
To probe further, Biesecker is now recontacting the five ClinSeq par- 
ticipants with the variant to ask them to take part in follow-up tests to 
check whether they have normal kidney function, including collecting 
multiple urine samples over a 24-hour period. 


To reassess the links between diseases and mutations, researchers 
must have access to a group of people whose detailed genetic and 
clinical information are known, and that’ rare. Italso takes time and 
some cost; multiply that by the huge numbers of ‘pathogenic’ vari- 
ants that have been called into question, and researchers are look- 
ing at a major undertaking, It’s a crucial one, because geneticists are 
being asked every day to make judgements about the harm that could 

be caused by mutations found in patients’ 


“Many havenot genomes. Biesecker hopes that planned or 
requiredenough _ existing projects to link people's genomes to 
evidence before their detailed health records — such as the 
asserting that US president's Precision Medicine Initiative, 
aparticular which aims to sequence at least 1 million 
variant is Americans, and the UK 100,000 Genomes 
harmful. Project — will help. 


‘The rethink on pathogenicity shows that 
researchers who hunt for genetic mutations likely to cause disease need 
to be cautious. Many, it seems, have not required enough evidence 
before asserting that a particular variant is harmful. 

Early efforts to discover the genetic underpinnings of disease started 
with families in which a particular condition recurred, generation after 
generation. By studying their extensive pedigrees, researchers could 
see strong evidence that certain mutations caused the disease. But in 
recent years, researchers have switched tactics: for instance, searching 
for evidence of pathogenicity by scanning for mutations that are more 
common in people with disease than in those without. It is becom- 
ing clear that many human genetic variations are relatively rare, and 
when researchers do not examine large enough groups of people with 
and without disease when scanning for pathogenic mutations, they are 
likely to mistakenly conclude that particular variants of interest turn 
up only in people with disease, The truth may be that they just haven't 
looked hard enough for these variants elsewhere. 

These conclusions have consequences for real people, and so 
researchers must go about this work differently. When they suspect 
that a variant is linked to disease, they should check to see how com- 
mon itis in databases such as ExAC, Even better, they should hunt 
for evidence that the mutation has a functional role in disease before 
declaring that itis pathogenic. Let the reckoning begin. m 


Buzzword off 


‘Nexus’ is enjoying new-found popularity. 
But what does it actually mean? 


curve, so it’s pleasing to see that this journal was using the 

word ‘nexus’ some 140 years ago. We mentioned in a book 
review the “organic nexus between the motor and tactile centres” in 
the brain in November 1876, for example. 

In the twenty-first century, the term nexus stands for more than its 
dictionary definition of a connection or focus point. It’sa buzzword, 
especially when tagged on to the end ofa string of associated nouns. 

For example, an article in Environmental Science and Policy 
this month draws attention to the popularization of the phrase 
‘water-energy-food nexus’ in debates over the use of natural resources 
(R. Cairns and A. Krzywoszynska Environ. Sci. Policy 64, 164-170; 
2016). 

Language matters and, although Aaron Ellison argues in this week's 
World View on page 141 that the term “natural resources" itself should 
be retired, well skip that to examine Cairns and Krzywoszynska’s main 
point: buzzwords are Orwellian and obfuscate even as they pretend to 
enlighten, Discussions of the nexus between connected crises, in other 


A t Nature we like to think we have always been ahead of the 
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words, can generate more linguistic heat than policy light. 

‘The authors argue that “understandings and usage of the term nexus 
are plural, fragmented and ambiguous’, This is not always viewed asa 
negative by those who use the word, of course. When there is little of 
substance to say, it often helps to use language that acts as a mirror so 
finely polished that every reader can see their own agenda and interests 
reflected. There are echoes here of the way the term Anthropocene has 
been adopted and borrowed by a range of scientific disciplines, each 
of which wants a taste of the action. 

“The term nexus appears to have something of a paradoxical 
quality” say Cairns and Krzywoszynska, “being simultaneously unar- 
guably true ata simple descriptive level, and yet confusingly unintel- 
ligible or meaningless to actors unfamiliar with the discourse” 

‘The motives behind the eagerness to jump on the nexus bandwagon 
are not always sinister. Honest brokers use the term to try to encapsu- 
late that unspoken and undefined territory where the implications of 
one action bleed into another; when the equal and opposite reaction 
also has consequences. 

But the risk is that containing this territory, however loosely, con- 
strains it instead — and that the nexus becomes the focus of the anal- 
ysis, rather than a natural consequence of studying the supporting 
problems 

Perhaps, like the most distant stars, the nexus is best viewed only 
with peripheral vision: we can see it’ there, but we shouldn't focus 
our gaze directly on it lest its true nature slips from view. And, at the 
very least, it shows that we should choose our buzzwords with care. m 
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Great Elephant Census, which revealed a precipitous decline 

in numbers throughout Africa, there were the usual calls from 
researchers for more and better data. Only if we know where and how 
many of each species there are, this argument goes, can we hope to 
conserve them. This is nonsense. 

Better data will not save elephants, rhinos or any other species. An 
enormous number of individuals, academic institutions, local, state 
and national governments, and multinational and non-governmental 
organizations have been collecting, assimilating and organizing such 
data for decades, essentially fiddling while our biological heritage burns. 

Of course, biodiversity data can be important for conservation, to 
suggest priorities and to draw attention to threatened and endangered 
species. But biodiversity data rarely drive con- 
servation decision-making, Rather, in the vast 
majority of cases, they are used to bolster deci- 
sions made for other reasons. The decisions last 
week by the Convention on International Trade 
in Endangered Species of Wild Fauna and Flora 
(CITES) to tighten trade in endangered species 
of sharks, parrots and pangolins shows this. Fas- 
cination, charisma and plush toys captured the 
imagination of the delegates, and journalism, 
political pressure and social-media campaigns 
pushed the decisions. 

This week's Global Scientific Meeting in 
South Africas Kruger National Park of the Inter- 
national Long-term Ecological Research network 
(ILTER) demonstrates the problem. With its long 
and enviable track record in integrating social 
dynamics into the study of ecological systems.and 
engaging with policy- and decision-makers to develop conservation 
policies, ILTER is meeting in the lengthening shadows of faceless 
elephants, dehorned rhinos, vanishing gorillas and the many other 
threatened and endangered endemic species of a continent besieged 
by Al-Qaeda, Al-Shabaab, ISIS and Boko Haram. Many nations are 
embroiled in civil conflict and some are ruled by corrupt kleptocrats 
more interested in using the government purse to renovate their estates 
than to lift their populations out of poverty, much less to conserve their 
biodiversity or even adequately staff their ‘paper parks: 

In this light, do we really need more scientific sessions on nitrogen 
cycling or drivers of biodiversity across scales? Sure, ifthe goal is simply 
to publish more abstruse papers and more data sets that will be read 
only by our friends and colleagues. But we should not delude ourselves 
that these sessions, or the data and scientific syntheses they yield, will 
help decision-makers find the energy and backbone to stop elephant 
poaching in Africa, clearcutting and burning in Indonesia, fracking and 
fouling of water suppliesin North America or eating anything that walks 
with its back to the sky in China. 


I Te can scientists protect biodiversity? In the wake of August's 


BETTER DATA 
WILL 


NOT SAVE 


ELEPHANTS, 
RHINOS OR 


ANY OTHER 
SPECIES. 


It’s time to get real 
about conservation 


To protect endangered species from extinction, the ecological community 
must become more politically involved, argues Aaron M. Ellison. 


Rather, ifbiodiversity really matters for the planet, and is essential 
for humanity's well-being, we need to get real about what it will take to 
conserve it for future generations. I suggest three crucial actions that 
scientists can take, beginning right now. 

First, stop referring to anything that isn't human as a ‘natural 
resource. Language matters, and this language suggests that the exist- 
ence of other species is predicated on the benefits they provide for us. 
Natural historians and systematists have long asserted that we need to 
‘put names to faces’ before we can care about non-human species. But 
even though we have already described and named millions of species, 
the precipitous decline of worldwide biodiversity makes it abundantly 
clear that naming species isn't enough. 

Second, acknowledge that better data rarely lead to ‘better’ decisions 
(orat least to those decisions we think we would 
make if we were in charge). No amount of data 
can overcome visceral negative responses to bats, 
spiders or snakes, or positive ones to pandas, pan- 
golins or baby seals. Decisions about which species 
to save — and which to triage to extinction —are 
based on raw emotion, the views of many different 
stakeholders and myriad political calculations. As 
the CITES process has demonstrated, data can be 
marshalled to support conservation decisions with 
broad-based support from a range of parties. But 
such consensuses are increasingly hard to come by, 
the resulting CITES decisions still do not provide 
airtight protection, and as conflicts rage around 
the world and rapid economic growth continues 
to be prioritized over conservation in both devel- 
oping and developed countries, biodiversity will 
continue to decline, 

Third, more scientists must get actively involved in the political 
process. Calling, e-mailing and writing to political leaders is a small 
but necessary first step. Showing up for seemingly endless political 
meetings isa larger but necessary follow-up. If we're notin the room, 
our voices won't be heard. Volunteering for local, regional, national 
or international groups directly involved in conservation decisions 
isa bigger commitment. But if not us, who? And running for elected 
office would logically follow. If not now, when? 

Scientists studying ozone depletion and climate change have shown 
that getting involved directly in the decision-making process can give 
scientists a place at the global table and a voice to help effect political 
change. Scientists who both study biodiversity and want to see other 
species persist and thrive must follow their example. 


Aaron M. Ellison is the senior research fellow in ecology at 
Harvard University and a conservation commissioner in Royalston, 
Massachusetts. 

e-mail: aellison@fas.harvard.edu 
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RESEARCH HIGHLIGHTS 


Selections from the 
scientific literature 


Protozoan 
protects the gut 


Many single-celled 
microorganisms are harmful, 
but others regulate the immune 
responses of their animal hosts 
to guard against infections. 
Some such organisms, called 
protozoa, live in the intestine, 
but have not been as well 
studied as their disease-causing 
counterparts, Miriam Merad at 
the Icahn School of Medicine at 
Mount Sinai in New York City 
and her colleagues identified 
apreviously unknown 
protozoan, Tritrichomonas 
musculis, in the intestines of 
some laboratory mice. When 
this microbe colonized the 
guts of other mice, the animals 
exhibited an inflammatory 
response that protected 
against pathogenic Salmonella 
bacteria. However, the 
animals also showed increased 
susceptibility to inflammatory 
intestinal disease and colon 
tumours. 
Cell 167, 444-456 (2016) 


QUANTUM COMMUNICATION 


Quantum secret 
kept for a day 


Abit of information can be 
kept secure for 24 hours before 
being revealed — more than 

5 million times longer than the 
previous record. 

Quantum cryptography 
guards against eavesdroppers, 
but in secure voting and 
sealed-bid auctions,a message 
must remain unread and 
protected for a certain period 
of time, Routing the message 
through a pair of trusted 
‘friends’ that are between the 
sender and receiver can delay 
and secure the message, but 
the friends would need to be 
located extremely far away 
from the sender and receiver 
toachievea delay of more than 
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CLIMATE 


Megadroughts loom large 


Climate warming looks set to plunge the 
‘American Southwest into decades-long drought 
by the end of the century. 

Such ‘megadroughts’ have hit the region 
(Lake Powell on the Colorado River, pictured) 
during the past millennium. To calculate 
how changes in temperature, rainfall and 
soil moisture will affect the likelihood of 
such events, Toby Ault of Cornell University 
in Ithaca, New York, and his colleagues ran 
simulations using climate models and two 


a few milliseconds. 
Anthony Martin and his 
colleagues at the University 
of Geneva in Switzerland 
developed a protocol in 
which this kind of exchange 
happened 5 billion times, 
with encryption occurring 
at each round that built 
on that created previously. 
This allowed the authors 
to separate the sender and 
receiver computers from their 
‘friends’ by just 7 kilometres 
while securing the bit for 


humans. 


24 hours. 
Phys. Rev. Lett. 117,140506 
(2016) 
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Human-like ape 
expectations 


Chimpanzees and other great 
apes seem to understand the 
beliefs of others, suggesting 
that this ability is not unique to 


Researchers have long 
debated whether humans are 
the only primates to havea 
“theory of mind’ — the ability 
to attribute mental states such 
as desires and beliefs to others. 
To test this, Christopher 
Krupenye at Duke University 


greenhouse-gas emission scenarios. 
Ifemissions continue to rise unabated, 
the projected increase in regional mean 
temperature alone will boost the risk ofa 
megadrought to 70-99% by 2100, depending 
on whether precipitation increases moderately, 
stays the same or decreases. If warming remains 
below 2°C compared to temperatures seen in 
the second half of the twentieth century, that 
risk falls to less than 66%. 
Sci. Adv. 2,e1600873 (2016) 


in Durham, North Carolina, 
Fumihiro Kano at Kyoto 
University in Japan and their 
colleagues monitored the gaze 
of chimpanzees, bonobos and 
orang-utans as they watched 
short videos. Two videos 
showed a person watching an 
object being hidden, and then 
searching for'it. A third video 
tested the apes’ understanding 
of false beliefs by showing the 
object being moved while the 
person was not watching. As 
the person then prepared to 
search for the object, most 
apes looked in anticipation to 
the location where the person 
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falsely believed the object was 
hidden. 

‘The authors suggest an 
implicit theory of mind 
predates human evolution. 
Science 354, 110-114 (2016) 


Warmer forests 
store less carbon 


Climate change might reduce 
the amount of carbon that 
forests can store, in part 
because photosynthesis 
decreases at high temperatures. 

Emily Meineke and her 
colleagues at North Carolina 
State University in Raleigh 
studied willow oaks (Quercus 
phellos) in the local area, 
where —as in other large 
cities — pavement and other 
hard surfaces absorb and 
slowly radiate the Sun's heat. 
This increases the temperature 
in some urban regions to 
a level comparable to that 
predicted for the next century 
as climate warming continues. 
The team measured trees’ 
photosynthetic rate, as well as 
factors such as water stress and 
pest prevalence, in hotter and 
cooler areas of Raleigh. 

Hotter areas saw less tree 
growth. The team calculates 
that the ‘urban heat island” 
effect reduced carbon 
sequestration in these trees 
by 12%. The reduction in 
growth was driven mainly by 
the effect of water deficits on 
photosynthesis, not increased 
herbivore activity. 

Proc. R. Soc, B 283, 20161574 
(2016) 


Diatoms sink in 
fits and starts 


Single-celled marine 
organisms called diatoms 
can rapidly alter the speed at 
which they move through the 
water column, despite lacking 
structures for motility. 
Diatoms are photosynthetic 
and area major contributor 
to ocean productivity. Brad 
Gemmell and his colleagues at 
the University of Texas Marine 
Science Institute in Port 


Aransas filmed three species 
of diatom (Coscinodiscus 
radiatus pictured) as they sank 
slowly in laboratory tanks. All 
exhibited stop-start movement: 
previously unobserved bursts 
of rapid sinking followed by 
periods of near-zero sinking. 
‘The authors visualized the flow 
of water around individual 
diatoms, and suggest that the 
organisms alter their buoyancy 
by exchanging ions with the 
seawater. 

Rapid changes in sinking 
speed could explain how these 
diatoms compete for nutrients 
with cells that can actively 
swim. 

Proc. R. Soc. B 283, 20161126 
(2016) 


Strange fading 
star probed 


Astar seems to have been 
dimming for years, possibly 
because ofa cloud of material 
obscuring it from view. 

Benjamin Montet at 
the California Institute of 
‘Technology and Joshua Simon 
at the Carnegie Observatories, 
both in Pasadena, used 
instruments on NASAS Kepler 
spacecraft to study a star in the 
constellation Cygnus called 
KIC 8462852, which is brighter 
and larger than the Sun, Four 
years of observations revealed 
that the star dimmed slowly at 
first, by around 0.9% in total, 
then faded more rapidly by 
2% in only six months. A few 
other stars nearby also became 
dimmer, but not to the same 
extent. 

‘The authors speculate that 
the star's dimming could be 
explained by the collision or 


break-up ofa planet or comets 
in the star's system, creating a 
cloud of spreading debris. 
Astrophys. J Lett in the press; 
preprint at https://arxiv.org/ 
abs/1608.01316 (2016) 


Gut bacteria help 
cancer drug 


Certain gut microbes work 
with a common cancer drug by 
boosting anti-tumour immune 
responses, making the therapy 
more effective in mice. 
Laurence Zitvogel of the 
Gustave Roussy Cancer 
Campus in Villejuif, France, 
and his colleagues studied the 
effect of two species of bacteria 
on theaction of the drug 
cyclophosphamide. When 
they gave antibiotic-treated 
mice the microbe Enterococcus 
hirae, they found that it made 
immune cells called T cells 
more active against specific 
tumour markers and caused 
intestinal immune cells to 
proliferate, Another bacterium, 
Barnesiella intestinihominis, 
drove immune cells to infiltrate 
tumours. In mice that lack 
a protein that restricts these 
species’ growth, the cancer 
drug was nearly twice as, 
effective at reducing tumour 
size than in normal animals. 
The results suggest that 
gut bacteria could be used to 
optimize cancer therapies, the 
authors say. 
Immunity http://doi.org/brmm. 
(2016) 


MATERIALS 


Supercapacitor 
made from MOF 


Researchers have built a 
high-capacity energy-storage 
device using a metal-organic 
framework (MOF) —a porous 
material with many desirable 
properties. 

MOFsare networks of 
metal ions linked together by 
organic molecules, and their 
large surface area means they 
hold promise as energy-storing 
supercapacitors. Mircea Dinca 
of the Massachusetts Institute 
of Technology in Cambridge 
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and his colleagues created such 
adevice with electrodes made 
only from a nickel-organic 
framework (Ni,(HITP),) that 
has high electrical conductivity. 
‘The supercapacitor stored 
more energy perarea than most 
other carbon-based devices, 
and retained more than 90% 
capacity after 10,000 cycles — 
ona par with commercial 
devices. 

Such a supercapacitor 
could have an important role 
in future energy grids, the 
authors say. 
Nature Mater. http://dx.doi. 
org/10.1038/nmat4766 (2016) 


Ocean on another 
of Saturn’s moons 


Like its neighbours Titan and 
Enceladus, Saturn's moon 
Dione may harbour an ocean 
beneath its icy surface. 

Mikael Beuthe and his 
colleagues at the Royal 
Observatory of Belgium in 
Brussels studied data collected 
from Enceladus and Dione 
by NASA’ Cassini spacecraft. 
They looked for small changes 
in the moons’ gravity and 
shape that can reveal layers of 
buried liquid. Data modelling 
suggested that Dione has a 
65-kilometre-deep global 
ocean hidden beneath some 
100 kilometres of ice. 

Those waters are a possible 
habitat for extraterrestrial 
microbes, should they exist. 
Geophys. Res. Lett. http://doi. 
org/brg7 (2016) 
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LIGO in India 


‘Aplanned Indian outpost 
of the Laser Interferometer 
Gravitational-Wave 
Observatory (LIGO) 

will probably be builtin 
Marathwada in the west of 
the country, after the state 

of Maharashtra announced 
on4 October that it had 
approved a 40-hectare site for 
the laboratory. LIGO-India 
will join a global network of 
gravitational-wave sensors; 
these include the two US 
sites, in Washington state 

and Louisiana, that this year 
reported the first detection of 
the waves. 


Ukraine joins CERN 


Ukraine has becomean 
associate member of CERN, 
Europe’ particle-physics lab 
near Geneva, Switzerland. The 
moveallows Ukraine to hold 
and attend meetings of the 
CERN Council, but without 
voting rights. Yurii Klymenko, 
Ukraine ambassador to the 
United Nations in Geneva, 
called the move, announced 

on 5 October, “an extremely 
important step on the way 

{to} Ukraine’ European 
integration’: Ukraine becomes 
CERN’ fifth associate member, 
alongside Serbia, Cyprus, 
‘Turkey and Pakistan. 


Foreign experts 
‘Academics were enraged 
last week by the assertion 
that the UK government had 
barred foreign academics 
from advising on Brexit 
projects, Reports emerged on 
7 October that the Foreign 
and Commonwealth Office 
(FCO) had told the London 
School of Economics and 
Political Science (LSE) that it 
would not take advice from 
academics who did not hold 
UK passports. Whether this 
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Haiti faces cholera in hurricane’s wake 


Hurricane Matthew has swepta path of 
destruction across the Caribbean, killing at least. 
1,000 people when it hit Haiti on 4 October, 
Fears ofa cholera outbreak are rising there in 
the wake of the storm, with at least 13 deaths 
from the disease reported so far. Matthew was 
the first category-5 hurricane in the Atlantic 
since Hurricane Felix in 2007. Its high winds 


communication did actually 
happen is unclear, The LSE 
maintains that it did, but the 
FCO — without commenting 
directly on the LSE’ claim — 
has said that it will continue to 
take advice from “the best and 
brightest minds, regardless of 
nationality" 


Detained phy: 
On 7 October, France's 
supreme administrative court, 
the State Council, refused an 
appeal by particle physicist 
Adlane Hicheur to have his 
house arrest lifted. Hicheur has 
previously been jailed in France 
for terrorism-related offences 
— charges vigorously disputed 
byhis colleagues —and had 
restarted his research life in 
Brazil following his release in 
2012. In July this year, he was 


ist 
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deported to France and placed 
under house arrest, under 
state-of-emergency powers 
introduced there following 
terror attacks. His ordeal has 
been widely condemned by 
his physicist colleagues. Ina 
farther twist last weekend, 
Hicheur, who has French and 
Algerian nationalities, sought 
toextricate himself from the 
situation by requesting that his 
French nationality be revoked 
and that he be immediately 
expelled from France. 


Kennewick man 

The US House of 
Representatives has passed 
legislation to return 
controversial human remains 
to Native Americans in 
Washington state. The fate of 
the 8,500-year-old ‘Kennewick 
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and heavy rains continued north but skirted 
the coast of Florida, where damage was less 
than originally feared. Only minor damage was 
reported by NASA's Kennedy Space Center, 
where the next-generation GOES-R weather 
satellite is awaiting launch to improve forecasts 
of, among other things, Atlantic hurricanes, The 
US death toll was reported to be at least 20. 


Man, anear-complete human 
skeleton, has been debated 
sinceits discovery in 1996. 
The case pitted scientists who 
wished to study the remains 
against Native American tribes 
that saw them as belonging 
toan ancestor. The bill must 

be reconciled with similar 
legislation passed last month by 
the Senate, and it will require 
the president’ signature. 


Aviation emissions 
After 3 years of debate, 
negotiators from 191 countries 
strucka deal to reduce carbon 
emissions from international 
aviation. The agreement, made 
on 6 October ata meeting 

of the International Civil 
Aviation Organization in 


‘CARLOS GARCIA RAWLINS/REUTERS 


JEFF CHIU/AP PHOTO 


Montreal, Canada, establishes 
standards for energy efficiency 
and an emissions-offsetting 
programme that will see the 
aviation industry invest in 
projects to reduce emissions in 
other sectors. The programme. 
could cover an estimated 75% 
of emissions growth from 
2021-35, resulting in 2.5 billion 
tonnes of carbon offsets — 
equivalent to more than 6 years 
of UK carbon emissions. 
Aviation accounts for 2% of 
global carbon emissions. 


Climate amendment 
United Nations delegates 
meeting in Kigali, Rwanda, 
are finalizing an amendment 
to the Montreal Protocol. 
whereby the treaty will target 
greenhouse gases as well as 
substances that destroy ozone, 
‘The decision to rapidly phase 
out hydrofluorocarbons — 
refrigerants that are powerful 
greenhouse gases — was 
taken last year, and described 
as “welcome — and long 
overdue" by Nature at the time 
(see Nature 527, 133; 2015). 
Delegates are likely to take 
until the meeting's final day on 
14 October to hammer out the 
details of the amendment. 


BUSINESS 


Theranos cuts 
Beleaguered blood-testing 
company Theranos of Palo 
Alto, California, announced in 
an open letter on 5 October that 


TREND WATC 


Investors fled on 5 October 
whena drug company focused 

on RNA-interference (RNAi) 
therapies announced that it would 
abandon one ofits lead drug 
candidates amid safety concerns. 
The drug, revusiran, was in phase 
Il clinical trials to treat a form 

of amyloidosis, The news sent 
stock in Alnylam Pharmaceuticals 
of Cambridge, Massachusetts, 
plummeting by roughly 50%. 
Companies have struggled to 
translate RNAi into therapies, and 
investors have been lukewarm on 
biotech stocks this year. 


itis closing its clinical labs and 
laying off some 340 employees 
— about 40% ofits staff. The 
company had faced huge 
scepticism over its claims that 
itcould perform hundreds 

of diagnostic tests using a 
single drop of blood. In July, 
US regulators banned its chief 
executive, Elizabeth Holmes 
(pictured), from running a 

lab for two years. Holmes said 
that the company will continue 
to develop its miniaturized 
blood-testing device, miniLab. 
Further troubles emerged on 
10 October, when a shareholder 
company announced that it 
would be suing Theranos to 
recoup its investment of nearly 
'US$100 million, which it says 
was secured on the basis of lies. 


Pe RESEARCH | 
Telescope time bias 


Female scientists are 
allocated less telescope time 
at the European Southern 
Observatory than are their 
male colleagues, according 
toan analysis posted on 


arXiv on 5 October (F. Patat 
Preprint at https://arxiv. 
org/abs/1610.00920; 

2016), The study of more 
than 13,000 proposals and 
3,000 principal investigators 
found that 16% of proposals 
submitted by women were 
successful, whereas men had 
asuccess rate of 22%. Much 
of the disparity seems to be 
because men submitting 
proposals have more senior 
positions on average than 
‘women, the study says, and 
astronomers at higher career 
levels received greater ratings 
for their proposals during the 
review process. 


Le WARDS 
Chemistry Nobel 


Jean-Pierre Sauvage, Fraser 
Stoddart and Bernard Feringa 
shared the 2016 Nobel Prize 

in Chemistry, announced 

on 5 October, for their work 
on creating tiny molecular 
machines. The three have made 
molecular knots, shuttles, 
rotors, chains, pumps, axles, 
switches, memory devices and 
even ananocar —all at the 
molecular scale. The nanoscale 
machines are yet to find 
applications, but researchers 
hope that their uses could 
range from delivering drugs to 
computer memory (see page 
152). Separately, Yoshinori 
‘Obsumi, the cell biologist who 
won the 2016 medicine Nobel, 
announced that he would use 


RNAi-DRUG FAILURE SENDS STOCK TUMBLING 


Alnylam Pharmaceuticals’ share price crashed by about 
50% on 5 October, atter the company halted a phase lll trial 


of a drug that uses RNA interference, 
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‘SEVEN DAYS | TH IS WEEK | 


16-21 OCTOBER 
The American 
Astronomical Society's 
planetary-sciences 
division and the 
European Planetary 
Science Congress meet 
in Pasadena, California. 
-aas.org/meetings/dps48 


1-10 NOVEMBER 
The first Berlin 

Science Week brings 
together academics and 
institutions from all, 
over the world to discuss 
science and society, 
‘www.herlinscienceweek.com 


the 8 million Swedish kronor 
(US$940,000) awarded to 
establish a system to provide 
support for young researchers 
over decades, 


Research jackpot 
AUS$75-million grant for 
research into coronary heart 
disease has been awarded toa 
team led by Calum MacRae, 
chief of cardiovascular 
medicine at Brigham and 
Women's Hospital in Boston, 
Massachusetts. The money 

is from the One Brave Idea 
fund, set up by the American 
Heart Association, Verily Life 
Sciences (owned by Google's 
parent company, Alphabet) 
and drug firm AstraZeneca 
to fund research on the most 
common type of heart disease 
in the United States. The grant, 
an unusually large sum for a 
single team, will be delivered 
in $15-million chunks over 
five years. 


CORRECTION 

The item ‘Nuclear go-ahead’ 
(Nature 537, 455; 2016) 
should have said that Hinkley 
Point C is expected to supply 
7% of UK electricity —not 
energy — demand. 


NATURE.COM 
Fordaily news updates see: 
www.nature.com/news 
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UK Prime Minister Theresa May gives the closing speech at the Conservative Party conference, where she took a hard li 


‘on immigration. 


Scientists spooked by UK 
anti-immigration stance 


Plans to restrict freedom of movement intensify fears over June’s Brexit vote. 


BY DANIEL CRESSEY 


K scientists say they're dismayed by 
[ their new government's toughened 

stance on curbing immigration, 
including ideas to restrict the flow of foreign 
students and workers, 

‘The government outlined its plans last week 
at the annual Conservative Party conference, 
which was the first since Theresa May became 
prime minister in the wake of June’s vote for 
Brexit — the decision that the United King- 
dom should leave the European Union. At the 


conference in Birmingham, politicians made it 
clear that they want to eliminate the free move- 
ment of EU citizens into the United Kingdom 
once the country splits from the EU, an event 
now expected to take place in 2019. “We are 
not leaving the European Union only to give up 
control of immigration again,’ said May. 
Since the referendum, in which concerns 
over immigration were believed to have played 
a big part in swaying voters, scientists have 
worried about how Brexit would affect the free 
movement of people. Although the govern- 
ment has not fleshed out its latest proposals, 
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they are the strongest indication yet that scien- 
tists will not be allowed to move freely between 
the United Kingdom and the EU after Brexit — 
which in turn means that UK researchers may 
be excluded from EU funding programmes 
“There has been a change in tone. I was sur- 
prised by how strong some of the comments 
were,” says Azeem Majeed, who heads the 
department of primary care and public health 
at Imperial College London. The perception 
that non-UK citizens are not welcome — which 
grew as a result of June's Brexit referendum — 
has only increased since the conference, 
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he says. That is particularly the case in 
health fields, because the government has 
pledged to cut the number of jobs for given 
to foreign doctors, in favour of UK citizens. 
UK universities get about 16% of their 
research funding and 15% of their staff from 
the EU, and scientists have been vocal about 
theneed to maintain freedom of movement. 
Itmay even be a prerequisite for UK access 
to EU research funding, When Switzerland 
restricted freedom of movement in 2014, its 
researchers lost access to the major Hori- 
zon 2020 research-funding programme, 
leading to protracted negotiations that are 
still ongoing. “The hard line on freedom 
of movement is almost certain to restrict 
us from EU funds," says Stephen Curry, a 
structural biologist at Imperial College Lon- 
don. Other comments on immigration and 
restricting foreign students are also going 
down poorly in academia, he says. “Its 
reinforcing the rather sour atmosphere” 


WORKER RESTRICTIONS 
UK home secretary Amber Rudd said in 
her conference speech that the govern- 
ment would consider making it harder to 
recruit from overseas, forcing companies 
to publicly disclose the proportion of for- 
eign staff working for them (an idea that 
other politicians later disavowed) and cut- 
ting down on universities’ ability to recruit, 
foreign students to “lower quality courses” 

“We had a very decisive message from the 
Conservative conference that the priority 
is simply reducing the number of people 
who come here, and if that damages the 
economy, so be it” says Jonathan Portes, an 
economist at the UK National Institute of 
Economic and Social Research in London. 

However, the conference was not all bad 
news for science and science policy, says 
Sarah Main, director of the Campaign for 
Science and Engineering in London. She 
says that the comments on immigration are 
concerning. But she adds that, at the Bir- 
mingham conference, “We've seen the gov- 
ernment being much more clearly positive 
about research and innovation in general” 

She cites a speech from Chancellor of, 
the Exchequer Philip Hammond — which 
praised science as a driver of growth and 
emphasized the need to get “the bright- 
est and best to work here in our high-tech 
industries" —and positive comments from 
science minister Jo Johnson at events away 
from the main auditorium. 

But Portes, chief economist for the UK 
Cabinet Office during the 2008-09 finan- 
cial crisis, says that Hammond's positive 
messages for science don't outweigh the 
negative impacts of May and Rudd's plans. 
“It'S nice to know the chancellor is not on 
the same page as the PM and the home 
secretary, but it seems pretty clear who is 
calling the shots” he says, 
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POPULATION STUDIES 


South Africa plans 
huge health study 


Network would be Africa’s largest demographics project if it 
can sustain long-term funding. 


BY LINDA NORDLING 


that it will expand the country’s existing 

demographic studies to create the largest 
project of its kind in Africa — tracking the 
health, income and educational attainment of 
around 1% of South Africas population. 

‘The Department of Science and Technology 
estimates that it will put 264 million rand 
(US$19 million) into the demographic pro- 
ject over the next five years, which will even- 
tually cover at least halfa million people. Ithas 
secured the funding for its first three years; the 
rest will need to be allocated in future govern- 
ment budgets. 

If the study can be sustained in the long 
term, researchers hope that the data will help 
them to track efforts to curb major health 
problems such as HIV and tuberculosis, and 
to monitor emerging lifestyle-related threats 
such as cancer and diabetes. The department 


S outh Africas government has announced 


intends the survey to run for decades, 
following people from the cradle to the grave 
and monitoring intergenerational trends. 

Long-term demographic studies have played 
an important part in charting disease patterns. 
One survey that began in 1948 in Framingham, 
Massachusetts, led to the discovery of cardio- 
vascular disease risk factors such as smoking 
and diabetes, and has allowed scientists to 
study intergenerational disease patterns. 

But in Africa, as in many other parts of the 
developing world, such long-term projects 
have been neglected in favour of a focus on 
health emergencies such as HIV or Ebola, says 
Glenda Gray, president of the South African 
Medical Research Council. 

“You never get your head above water to 
plan for the future,” she says. She thinks that 
the new project will change that. 

South Africa has had three demographic 
surveillance projects running since the mid- 
to late 1990s, based in Mpumalanga and 
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Limpopo in the northeast, and KwaZulu-Natal 
on the east coast. These have been able to track 
trends such as a growth in life expectancy as 
the country rolled out antiretroviral drugs to 
fight the HIV epidemic. But the long-term sus- 
tainability of such studies — which have been 
funded by non-governmental donors — isa 
perennial concern, says Kobus Herbst, deputy 
director of the African Health Research Insti- 
tute, based in Durban, which runs the study in 
KwaZulu-Natal. So the government’ invest- 
ment is particularly welcome, he says. 


FROM RURAL TO URBAN 
All the existing surveillance projects are in 
rural areas, providing only a narrow view of 
national population trends, says Gray. “The 
rural sites have been critical for understand- 
ing things like how antiretroviral rollout 
plays out in districts,” she says. But they don't 
catch emerging patterns of disease linked 
with modern city life, driven by factors such 
as pollution, work-related stress and dietary 
changes, Of the four new surveillance nodes 
in the planned network, three will be based 
in South Africa's biggest cities: Cape Town, 
Johannesburg and Durban (see “Tracking 
South African health’). 

The existing surveys already cover around 
250,000 people, but each collects different types 
of data, so their measurements cannot be com- 
pared or integrated together. During the first 
three years, the surveys will be linked up, and 
the Limpopo one will be expanded. By the end 
of the three-year period, a total of 300,000 peo- 
ple should be included in the project, says 
Herbst. The target of 500,000 people will be 
reached, hopefully, in five years’ time, he says. 

‘The government funding will cover the full 
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health and socio-demographic surveys, and 
will fund linkages to national health records 
and the collection of dried blood spots from 
adult participants once a year for HIV test- 
ing, Herbst says. To do more — such as DNA 
sequencing — will require funding from exter- 
nal donors. 

Linda Fried, an epidemiologist who is dean 
of Columbia University’s Mailman School of 
Public Health in New York City, thinks that 
the surveys will not only allow South Africa 
to develop its science base but will also attract 
international investment. 

‘The programme was launched on 4 October 


by South African science minister Naledi 
Pandor, at a conference to plan out South 
‘Africa’ first road map for national research 
infrastructures. In addition to the demo- 
graphic project, the road map launched this 
week includes plans for a nuclear-medicine 
research facility dedicated to drug development 
and clinical research, a solar-research facility 
to demonstrate photovoltaic technologies, and 
anew hub to coordinate efforts to protect the 
country’ natural-history collections. 

“We build big scientific infrastructure 
to attract international researchers to our 
country,” Pandor said. m 


PLANETARY SCIENCE 


NASA rethinks Mars exploration 


Agency considers time-allocation model in an era of shifting international interests. 


BY ALEXANDRA WITZE 


‘ASA is investigating a new way 
N: studying Mars, Starting in the 
2020s, scientists who participate in 
the agency's Mars missions might no longer 
design and build their own highly specialized 
payloads to explore the red planet. Instead, 
planetary scientists could find themselves 
operating much as astronomers who use 
large telescopes do now: applying for time 
to use a spacecraft built with a generic suite 
of scientific instruments, 
‘The proposed change is spurred by NASASs 
waning influence at Mars. The agency's 


long-running string of spacecraft is winding 
to aclose, and international and commercial 
interests are on the rise, By the middle of the 
next decade, European, Chinese, Emirati and 
SpaceX missions are as likely to be at Mars as 
NASA is (see ‘Red-hot planet’). 

Jim Watzin, head of NASA Mars explora- 
tion programme in Washington DC, suggested 
the new approach to the red planet on 6 Octo- 
ber ata virtual meeting of Mars scientists. “The 
era that we all know and love and embrace is 
really coming to an end," he said. “It impor- 
tant to recognize that the future is not going to 
be the same as the past.” 

Throughout the 2000s, NASA sent a 
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sustained barrage of spacecraft to Mars, unique 
in the sheer number of robots directed at one 
planetary target. But many have expired, and 
the ones still operating are growing old. NASAS 
three functional orbiters — Mars Odyssey, 
Mars Reconnaissance Orbiter, and MAVEN — 
launched in 2001, 2005 and 2013 respectively. 
‘The Opportunity rover is in its thirteenth year, 
and the Curiosity rover is in its fifth. 

NASA has only one more spacecraft 
scheduled in its Mars programme, a rover due 
to launch in 2020 thats tasked with gathering 
samples for an as-yet-unscheduled return to 
Earth. (The InSight geophysics probe, slated 
fora 2018 launch, was not developed under > 
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RED- HOT PLANET 


still has a f 


the auspices of NASAS Mars programme.) 

NASA wants to start planning for an orbit- 
ing mission to launch after 2020. In June, the 
agency asked five companies about what sorts 
of Mars orbiters they might be able to build, 
and how quickly and cheaply that could be 
done, Five international partners have also said 
they would like to be involved, Watzin said. 

Many non-NASA missions to Mars are 
already on the books, In 2020, the European 
Space Agency and China each plan to launch 
Mars rovers, while the United Arab Emirates 
will send an orbiter. SpaceX of Hawthorne, 
California, hopes to send its first Red Dragon 
landers to Mars in 2018. 

‘This broadening context prompted Watzin 
to propose the new way of operating Mars 


missions. “I'm not trying to fix something 
that's broken,” he said. “I'm trying to open 


the door toa larger level of collaboration and 


participation than we have tod: 
In the facility-based approach, scientists 
would propose investigations using one or 
more instruments on a future spacecraft. 
NASA would award observing time to spe- 
cific proposals, much as telescope-allocation 
committees parcel out time on their mountain- 
tops. This would be different from the current 
approach, in which individual teams of scien- 
tists propose, build and operate instruments, 
Watzin's proposal is a trial balloon, not an 
official change to NASA policy. “Its a little 
early yet to figure out how the community 
is going to respond,” says Jeffrey Johnson, a 


planetary scientist at the Johns Hopkins Uni- 
versity Applied Physics Laboratory in Laurel, 
Maryland, and head of the group that organ- 
ized the meeting. 

But some researchers are already pushing 
back. Alfred McEwen, a planetary scientist at 
the University of Arizona in Tucson, noted that 
the Mars Reconnaissance Orbiter’s HiRISE 
camera has taken thousands of images of Mars 
based on public requests. “We've managed to 
do all the things [Watzin] described already 
without a new paradigm,” says McEwen, the 
camera's principal investigator. “We have dit 
tributed operations, we have multiple custom- 
ers, we havea foreign contributed instrument. 
So my immediate reaction to this idea was not 
very positive” m 


[DRUG DEVELOPMENT J 


Safety concerns blight 
promising cancer therapy 


As the first T-cell treatments for tumours near US approval, researchers race to engineer 


less-toxic versions. 


BY HEIDI LEDFORD 


groundbreaking treatment that 
Ak immune cells called T cells to 

battle cancer is barrelling towards 
regulators, fuelled by unprecedented clinical 
success and investor exuberance, 

But progress of the therapy, called 
CART, has been marred by its toxicity; 
several deaths have been reported in clinical 
150 
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trials. Even as the first company readies its 
application to the US Food and Drug Admin- 
istration (FDA) — expected by the end of the 
year — researchers are hard at work to make 
the supercharged T cells safer. 

Doing so is crucial to expanding the use 
of the therapy to more people, says Anthony 
Walker, a managing partner at Alacrita, a 
consulting firm in London. “Right now it 
is heroic medicine,” he says — a gruelling 
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treatment deployed only in people for 
whom all else has failed. “Patients are taken 
sometimes to within an inch of their lives” 
Most CAR-T procedures begin by harvest- 
ing a patient's white blood cells and sifting 
out the T cells, Those T cells are engineered 
to recognize cancer cells, and then infused 
into the patient, ready to do battle. The 
approach has shown remarkable success 
against leukaemias and lymphomas: in one 
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study, all traces of leukaemia disappeared in 
909% of the patients who received the treat- 
ment (S. L. Maude et al. N. Engl. J. Med. 371, 
1507-1517; 2014). 

Results such as those e fuelled an 
investor frenzy. “It set the field on fire,” says 
Walker. Swiss pharmaceutical giant Novartis 
invested in the technique in 2012. In 2014, 
CAR-T firm Kite Pharma of Santa Monica, 
California, raised USS$128 million when it 
went public, A few months later, one of its 
competitors, Juno Therapeutics of Seattle, 
Washington, yielded $264 million in its 
initial public offering 

Now Kite is racing to be the first to bring a 
CAR-T therapy to the market. On 18 October, 
the company will update investors on its 
plans to manufacture and sell the complex 
therapy, which it hopes to launch in 2017. 

But the treatment's toxicity has discour- 
aged some investors, On 26 September, 
Kite released interim clinical-trial results — 
widely seen as successful — in people with 
aggressive non-Hodgkin's lymphoma (see 
go.nature.com/2djdgen). Yet about one-third 
of the patients developed serious neurologi- 
cal side effects, and 18% developed a deadly 
condition called cytokine release syndrome, 
which can cause organ failure. Two of the 
62 patients died as a result of the treatment. 

That toxicity is unlikely to dissuade the 
EDA, given the dramatic effects of the treat- 
ment, says analyst Michael Yee of the invest- 
ment bank RBC Capital Markets in San 
Francisco, California. “It has transformed 
what was essentially a death sentence into a 
potential for long-lasting remission,” he says. 

But the toxicity does leave room for 
improvement. One approach that research- 
ers are studying to boost safety is to improve 
standardization of each patient's dose of 
T cells. CAR-T therapies typically begin with 
a mixture of various kinds of T cell, some 
with very different functions. “Not all T cells 
are created equal,” says Stanley Riddell, an 
immunologist at the Fred Hutchinson Cancer 
Research Center in Seattle. To create a better- 
defined T-cell cocktail, Riddell’s lab first sorts 
out different types of T cell and blends them 
together again in specific proportions. So far, 
he says, trials in 140 patients suggest that the 
approach provides better control of dosage — 
and toxicity (see, for example, C. J. Turtle etal 
J. Clin. Invest. 126, 2123-2138; 2016). 
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Other groups have developed a ‘suicide 
switch’ to shut off the CAR-T cells in the 
body. If toxicity is spiralling out of control, 
doctors can administer a drug that activates 
the switch — a modified version of a protein 
called caspase-9 — and triggers the CAR-T 
cells to self-destruct. 

‘That approach has not been popular with 
clinical researchers, who often opt to treat 

toxic reactions with 


“Ithas other drugs rather 
transformed than risk shutting 
what was down the treatment 
essentially a altogether, notes 
death sentence Michel Sadelain, 
into apotential an immunologist 


at Memorial Sloan 
Kettering Cancer 
Center in New York 
City. “What you're doing is destroying your 
extremely expensive medication after you've 
administered it,” says Walker. 

But cancer researcher Malcolm Brenner 
of Baylor College of Medicine in Houston, 
Texas, says that the switch is not all or noth- 
ing: adding just a little of the activation drug 
can dampen toxic effects without killing all 
of the engineered T cells. 

Michael Brown, a clinician and cancer 


for long-lasting 
remission.” 


can be destroyed by CAR-T therapy. 
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researcher at the Royal Adelaide Hospital in 
Australia, is using the suicide-switch approach 
in his CAR-T clinical trial against melanoma. 
So far, he says, his patients haven't needed it. 
But having the switch in place helped him 
to feel more secure about the trial, in which 
T cells target a protein that is more abundant 
in melanoma but is also expressed at low levels 
in normal brain tissue. 

Many researchers hope to reproduce 
CAR-T's success against leukaemia in solid 
tumours such as melanoma, And, like Brown, 
they are struggling to find proteins on can- 
cer cells that could serve as targets for T cells 
but that are absent from normal tissues, One 
way could be to focus on multiple proteins 
expressed by cancer cells, says Sadelain, 
‘The therapy will then attack only cells that 
express all of those proteins, to provide a 
more precise way to mark tumour cells for 
destruction. 

Fornow,all eyesare on Kite to see whether it 
can get its therapy approved by regulators and 
into hospitals. Even if the company succeeds, 
Walker is betting that the treatment will be the 
first in a long line of CAR-T therapies. “We're 
at such an early stage in this field,” he says. 
“Ifyou wind the clock forward 10-15 years, 
I think it will be unrecognizable” m 
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J EDUCATION | 
Where Nobel 
winners start 


List reveals undergraduate 
schools with most laureates. 


BY TOM CLYNES 


new study ranks institutions by the 
Awe oftheir undergraduates that 
go on to win Nobel prizes. 

‘Two schools dominate: Ecole Normale Supé- 
rieure (ENS) in Paris and the California Insti- 
tute of Technology (Caltech) in Pasadena. These 
small, elite institutions each admit fewer than 
250 undergraduates per year, yet their per capita 
production of Nobelists outstrips some larger 
world-class universities by factors of hundreds. 

“This is a way to identify colleges that have 
a history of producing major impact says 
Jonathan Wai, a psychologist at Duke University 
in Durham, North Carolina, anda co-author of 
the unpublished study. 

Wai and Stephen Hsu, a physicist at Michi- 
gan State University in East Lansing, examined 
the 81 institutions worldwide that each had at 
least three alumni who received a Nobel award 
in any of the six categories between 1901 and 
2015. Fora meaningful comparison, the team 
divided the number of Nobel laureates at a 
university by its estimated number of under- 
graduate alumni. 

Many of the top Nobel-producing institutions 
are private, with significant financial resources, 
Among the more surprising high performers 
were several small US liberal-arts colleges, such 
as Swarthmore College in Pennsylvania (ranked 
at number 4) and Amherst College in Massa- 
chusetts (number 9). 

‘To gauge trends over time, Wai cut the sample 
of 870 laureates into 20-year bands. US univer- 
sities, which now make up almost half of the 
top 50 list, began to dominate after the Second 
World War. Whereas French representation in 
the Nobel ranks has declined over time, top- 
ranked ENS has remained steady in its output. 

Santo Fortunato, a theoretical physicist at 
Indiana University Bloomington who has 
researched trends in Nobel prizewinners, cau- 
tions that the methodology cannot produce a 
highly accurate or predictive ranking, “There is 
ahigh margin of error due to the low numbers 
of prominent scholars,’ he says. 

Wai and Hsu agree that there are statistical 
uncertainties in their rankings, owing to the 
small number of prizes awarded each year. The 
two are confident that the ENS and Caitech lead 
the pack, but say that statistical fluctuations 
could change the order of institutions placed 
from third to ninth, Hsu says. a 
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NANOTECHNOLOGY 


Molecular architects: Fraser Stoddart, Bernard Feringa and Jean-Pierre Sauvage. 


Chemistry Nobel 
for nanomachines 


Award recognizes three pioneers of molecular motors. 


BY RICHARD VAN NOORDEN & 
DAVIDE CASTELVECCHI 


lhree chemists who created tiny 
Taste machines have won the 

2016 Nobel Prize in Chemistry for 
their intricate designs. 

Jean-Pierre Sauvage, at the University 
of Strasbourg in France; Fraser Stoddart, 
a Scottish-born chemist at Northwestern 
University in Evanston, Ilinois; and Bernard 
Feringa, at the University of Groningen in the 
Netherlands, share the award for their work 
in the 1980s and 1990s, when they pioneered 
efforts to miniaturize motors. 

“I'm a bit shocked because it was such 
a great surprise. And I'm so honoured,” 
said Feringa in a telephone interview with 
the Nobel Committee just after the prize- 
winners were announced in Stockholm on 
5 October. 

‘The three have made molecular knots, 
shuttles, rotors, chains, pumps, axles, 
switches, memory devices and even a 
nanocar —all at the scale of molecules (see 
“Nanomachines’). The nanoscale machines 
are yet to find application, but researchers 
hope that their uses could range from deliv- 
ering drugs to computer memory. 

“It’s early days, of course," Feringa told the 
Nobel Committee. “But once you are able to 
control moyement, you have a motor, you 
can think of all kinds of functions.” He sug- 
gested that the machines could be used as 
tiny robots in the body to deliver drugs or 
detect cancerous cells; or as smart materials 
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that could adapt or change depending on 
external signals. 

“T applaud the fact that — for once in 
chemistry — Stockholm has recognized a 
piece of chemistry that is fundamental in its 
making and being” Stoddart said at a press 
conference at Northwestern University, held 
later in the day 

Only a handful of laboratories are 
currently actively engaged in making 
nanomachines, says Dean Astumian, who 
studies the theory of molecular motors at the 
University of Maine in Orono. But he thinks 
the field will get a boost from the award, “The 
recognition that is afforded bya Nobel prize 
is going to attract the best young people,” he 
says, Astumian thinks the work will provide 
applications within 25 years. “There's no 
device that you can buy that’s made out of 
molecular machines. But they're coming” 


MOLECULAR ARCHITECTS 
In 1983, Sauvage’s group was the first to create 
molecular interlocking chains and rings — 
called catenanes — which were the first steps 
to creating the connected parts needed for 
molecular motors. By creating interlocking 
rings, Stoddart noted at his press conference, 
Sauvage’s group effectively invented a new 
way to bind molecules together — a mechani- 
cal bond, rather than a chemical one. “New 
bonds are few and far between. They are really 
the blue moons,” Stoddart said. 

Stoddart himself, in 1991, created the first 
molecular shuttle: a ring-shaped molecule 
threaded onto an ‘axle, called a rotaxane, The 
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ring could shunt back and forth between two 
sites on the axle, which was capped at each end 
by stoppers, and Stoddart and other chemists 
worked out how to control that process, using 
changes in acidity, light or temperature, 

Since then, Stoddart’ team has used similar 
rotaxanes to makea molecular lift} which can 
raise itself (by less than a nanometre) above 
a surface, and an artificial ‘muscle, in which 
rotaxanes bend a thin sheet. The researchers 
have also used millions of rotaxanes to make a 
high-density memory device — in which the 
shuttles flick from an ‘on state to an ‘off’ state. 

‘And in 1999, Feringa was the first to develop 
a synthetic molecular motor — a single mol- 
ecule with paddle units connected by a car- 
bon-carbon double bond. The paddles rotated, 
and kept on spinning, when the bond was bro- 
ken with light. Feringa showed that the motors 
could have macroscale effects, such as rotat- 
ing a glass rod sitting on top of them. Perhaps 
most famously, Feringa has also created a four- 
wheel-drive ‘nanocar out of the motors. 


WIDER IMPACTS 
The Nobel prizewinners’ work — and other 
chemists’ nanomachines — have also had 
an impact on researchers’ understanding 
of nature, Astumian says. In particular, the 
artificial systems have helped to demon- 
strate that all chemically powered molecular 


NANOMACHINES 


Mechanisms the size of molecules are governed. 
by the rules of chemistry, rather than Newtonian. 
mechanics. An example is this switch called a 
rotaxane. A ring-shaped molecule is threaded 
onto a linear molecule and shifts between two 
binding sites as the acidity of the surrounding 
solution is altered, 
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machines, whether synthetic or biological, 
work on the same principle: they selectively 
harvest the random jiggles of Brownian 
motion, rather than push against them. 
‘Asked by reporters at the Nobel press con- 
ference whether his machines would finda 
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use, Feringa likened the creators of minuscule 
machines to the Wright brothers, who made 
their maiden flight in a powered aircraft more 
than 100 years ago. “People were saying, why 
do we need a flying machine? Now we have 
a Boeing 747 and an Airbus. That’ a little bit 
how I feel. The opportunities are great” 
During his own press conference, Stoddart 
also took political swipes, both at recent UK 
anti-immigration rhetoric and at US Presi- 
dential candidate Donald Trump. He said 
that his old country, the United Kingdom, 
was “in a real mess because it thinks it can 
raise borders to people coming in’, And refer- 
ring to Trump’s comment in his first debate 
with Hillary Clinton that not paying federal 
taxes would be “smart, Stoddart said that 
one-third of his Nobel earnings would go to 
taxes, because, he said, “am not smart’. 


‘CORRECTIONS 

The News story ‘Ukraine embraces solar 
and wind power’ (Nature 537, 598; 2016) 
gave the wrong year for the annexation of 
the Crimean peninsula. it happened in 2014. 
And the News story ‘Medical award for cell 
recycling’ (Nature 538, 18-19; 2016) gave 
the wrong affiliation for Hitoshi Nakatogawa 
—he isat the Tokyo Institute of Technology. 
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SEEING 


DEADLY 
MUTATIONS IN 
ANEW LIGHT 


How one of the largest genome 
resources in the world has 

quietly been changing scientists’ 
understanding of human genetics. 


urking in the genes of the average person 
are about $4 mutations that look as if they 
should sicken or even kill their bearer. 
But they don't. Sonia Vallabh hoped that 
DI178N was one such mutation. 
= In 2010, Vallabh had watched her 
mother die from a mysterious illness called fatal 
familial insomnia, in which misfolded prion 
proteins cluster together and destroy the brain. 
The following year, Sonia was tested and found 
that she had a copy of the prion-protein gene, 
PRNP, with the same genetic glitch — D178N 
— that had probably caused her mother’ ill- 
ness, It wasa veritable death sentence: the aver- 
age age of onset is 50, and the disease progresses 
quickly. But it was not a sentence that Vallabh, 
then 26, was going to accept without a fight. So 
she and her husband, Eric Minikel, quit their 
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respective careers in law and transportation 
consulting to become graduate students in 
biology. They aimed to learn everything they 
could about fatal familial insomnia and what, 
ifanything, might be done to stop it. One of the 
most important tasks was to determine whether 
ornot the D178N mutation definitively caused 
the disease. 

Few would have thought to ask such a ques- 
tion in years past, but medical genetics has been 
going through a bit of soul-searching. The fast 
pace of genomic research since the start of the 
twenty-first century has packed the literature 
with thousands of gene mutations associated 
with disease and disability. Many such asso 
tionsare solid, but scores of mutations once sug- 
gested to be dangerous or even lethal are turning 
out to be innocuous, These sheep in wolves’ 


13 OCTORER 2016 


lan Publis 


clothing are being unmasked thanks to one of 
the largest genetics studies ever conducted: the 
Exome Aggregation Consortium, or ExAC. 

ExACisa simple idea. It combines sequences 
for the protein-coding region of the genome — 
the exome — from more than 60,000 people 
into one database, allowing scientists to com- 
pare them and understand how variable they 
are, But the resource is having tremendous 
impacts in biomedical research. As well as help- 
ing scientists to toss out spurious disease-gene 
links, itis generating new discoveries. By look- 
ing more closely at the frequency of mutations 
in different populations, researchers can gain 
insight into what many genes do and how their 
protein products function, 

ExAC has turned human genetics upside 
down, says geneticist David Goldstein of 
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Columbia University in New York City. Instead 
of starting with a disease or trait and working 
backwards to find its genetic underpinnings, 
researchers can start with mutations that look 
like they should have an interesting effect and 
investigate what might be happening in the peo- 
ple who harbour them. “This really isa new way 
of working” he says. 

ExAC is also providing better information 
for families facing genetic diagnoses. D178N, 
for example, was strongly suspected of caus- 
ing prion disease because it had been seen in 
several people with the condition and seldom 
elsewhere, But before ExAC, no one really had 
the power to see just how rare it was. Ifit shows 
up in people more frequently than prion disease 
does, that would mean Vallabhis risk of getting 
the disease is much lower than predicted. 


2016 Mi 


“We needed to find out if this mutation 
had ever been seen in a healthy population.” 
Minikel says. 


DATA GATHERING 
ExAC was born of frustration, In 2012, geneti- 
cist Daniel MacArthur was starting his first 
laboratory, at Massachusetts General Hospital 
(MGH) in Boston. He wanted to find genetic 
mutations that caused rare muscle diseases, 
and needed two things: genome sequences 
from people with these disorders, and genome 
sequences from people without them. Ifa muta- 
tion was more common in people with a disor- 
der than in healthy controls, it stood to reason 
that the mutation was a likely cause. 

‘The problem was that MacArthur couldn't 
find enough sequences from unaffected 
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people, He needed lots of exomes, and although 
researchers had been sequencing them by the 
thousands, existing data sets weren't large 
enough. No one had pulled enough together 
into one combined, standardized resource. 

So MacArthur started asking his colleagues 
to share their data with him, He was well suited 
to the task: an early adopter of social media, his 
lively blog posts and acerbic Twitter feed had 
made him unusually popular and authorita- 
tive fora young scientist. He also had a position 
with the Broad Institute in Cambridge, Mas- 
sachusetts, a genome-sequencing powerhouse. 
MacArthur convinced researchers to share data 
from tens of thousands of exomes with him; 
most were in some way connected to the Broad. 

All that remained was to analyse the data, but 
that was no trivial task. Although the genes had 
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Daniel MacArthur convinced researchers to share genetic data on tens of thousands of people. 


been sequenced, the raw data had been analysed 
using different types of software — including 
some that were out of date, If one individual in 
the collection showed a rare mutation, it could 
be real — or it could be an artefact of how differ- 
ent programs ‘called’ the bases within, judging 
whether they were As, Cs, Ts or Gs. MacArthur 
needed something that would standardize this 
gigantic data set. The Broad had developed 
genome-calling software, but it wasn't up to 
the task of churning through the tremendous 
amount of data included in ExAC, So MacAr- 
thur’s team worked closely with the Broad pro- 
grammers to test the software and scale up its 
abilities. “That was a pretty horrific 18 months.” 
MacArthur recalls, “We ran into every obstacle 
imaginable and had nothing to show for it” 


PERSONAL STAKE 
While this was going on, in April 2013, Vallabh 
was learning how to work with stem cells at 
MGH while Minikel studied bioinformat- 
ics. Minikel met MacArthur for lunch and 
explained his and Vallabh’s curiosity about 
whether D178N existed in healthy people. 
He admits to being a bit star-struck by Mac- 
Arthur's reputation. “I thought if I could get 
him to think about my problem for half an 
hour, that would probably be the most impor- 
tant thing that happened in my whole month?” 
Minikel says. The pair went upstairs to Mac- 
Arthur’ lab, where bioinformatician Monkol 
Lek ran a search on the ExAC data that had 
been analysed so far — about 20,000 exomes. 
They didn't see Vallabh’s mutation, That wasn't 
good news, but, optimistic about exploring the 
data further, Minikel joined MacArthur’ lab. 

By June 2014, MacArthur's team and its 
156 
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collaborators had a data set that they were 
confident in — exomes from 60,706 indi- 
viduals representing various ethnic groups, 
who met certain thresholds for health and 
consent. They released ExAC that October at 
the annual meeting of the American Society 
of Human Genetics (ASHG), in San Diego, 
California, Immediately, researchers and phy- 
sicians recognized that the data could help to 
recast their understanding of genetic risks. 
Many disease-association studies, particu- 
larly in recent years, have identified mutations 


“IFYOUHAVEAGENETICRISK 
THAT YOU BELIEVEIS PREDICTING 
DISEASE BUT ISNT, YOUCAN END 
UPDOING DRASTIC THINGS,” 


as pathogenic simply because scientists per- 
forming analyses on a group of people with a 
disorder found mutations that looked like the 
culprit, but didn't see them in healthy people. 
But it’s possible that they weren't looking hard 
enough, or in the right populations. Baseline 
‘healthy’ genetic data has tended to come 
mainly from people of European descent, 
which can skew results. 

In August this year, MacArthur's group 
published! its analysis of ExAC data in Nature, 
revealing that many mutations thought to be 
harmful are probably not. In one analysis, 
the group identified 192 variants that had 
previously been thought to be pathogenic, 
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but turned out to be relatively common. The y 
scientists reviewed papers about these variants, = 
looking for plausible evidence that they actually = 
caused disease, but could find solid evidence 


for only nine of them, Most are actually benign, & 
according to standards set by the American 2 
College of Medical Genetics and Genomics, 
and many have now been reclassified as such. 

Similar work promises to have directimpacts 
‘on medical practice, In a companion paper’, 
geneticist Hugh Watkins of the University of 
Oxford, UK, looked at genes associated with 
certain types of cardiomyopathy that cause 
gradual weakening of the heart muscle, Unde- 
tected, they can lead to sudden death, and it 
has become fairly common to check relatives 
of people with the conditions for genetic muta- 
tions associated with them, Those found to have 
a genetic risk are sometimes counselled to getan 
implanted defibrillator, which delivers electri- 
cal shocks to the heart ifit seems to be beating 
abnormally. Watkins checked the ExAC data- 
base for information on genes that have been 
associated with these heart conditions, and 
found that many mutations are much too com- 
mon among healthy people to be pathogenic. 
About 60 genes had been implicated as harbour- 
ing pathogenic mutations that cause one form of 
the disease; Watkins’ analysis revealed that 40 of 
these probably bear no link. 

‘This was troubling. “If you havea genetic risk 
that you believe is predicting disease but isnt, 
you can end up doing drastic things that can 
harm someone’ says Watkins. 

Even some of the mutations that seem to be 
reliably linked to disease arentta sure bet — such 
as those in PRNP. Thereare definitely mutations 
in the gene that cause the disease, but some 
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variants might not be pathogenic or 
might elevate the risk only slightly 
(see ‘The deadly mutations that 
weren't’). To find out the status of 
DIZ78N, Vallabh and Minikel gath- 
ered genetic data from more than 
16,000 people who had been diag- 
nosed with prion diseases, and com- 
pared them with data from almost 
£600,000 others, including the ExAC 
participants’. 

‘The pair found that 52 people in 
ExAC had PRNP mutations that 
have been linked to prion diseases, 
but based on the prevalence of the 
disease, they would have expected 
to see maybe two. Minikel calculated 
that some of these supposedly lethal 
mutations elevated a person’ risk of 
prion disease slightly; some seemed 
not to be linked to prion disease at all. 

This work provided insight for 
people such as Alice Uflacker. In 
2011, Uflacker’s father, Renan, died 
from Creutzfeldt-Jakob disease, a 
prion illness that causes rapid men- 
taland physical deterioration, He was 
62. Alice found out that she carried 
a mutation in PRNP called V2101, 
which had been linked to her father's 
disease in previous studies. Three 
years later, she learned from Minikel 
that the mutation confers, at most, a 
small risk of disease. The informa- 
tion was helpful, and the result made 
sense; her grandmother had lived to 
93 despite having the same mutation. 

Vallabh and Minikel would find 
no such relief, however. D178N was 
absent from the other genomes they 
looked at, and is still highly likely 
to cause prion disease. Minikel and 
‘Vallabh had already begun to suspect 
as much, as Minikel dug into the data. 
“All along the way was gradual con- 
firmation of what we were assum- 


THE DEADLY MUTATIONS THAT WEREN'T 


Prion diseases are rare neurodegenerative disorders caused by misfolded 
pprion proteins. About 63 mutations in the gene PRNP have been linked to 
them, But until now it has been difficult to estimate how likely tis that a 

given variant will result in disease, a measure known as penetrance, Data 

compiled by the Exome Aggregation Consortium (ExAC) can help. 
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medical genetics, Clinical labsaround 
the world now check it before tellinga 
patient that a particular glitch in their 
genome might be making them ill. If 
the mutation is common in ExAC, 
it's unlikely to be harmful. Geneticist 
Leslie Biesecker at the US National 
Human Genome Research Institute 
in Bethesda, Maryland, says that his 
ab uses ExAC daily in patient care. 
“Ifa critical factor that we take into 
consideration for every variant,” he 
says, He and other geneticists are now 
embarking on a painstaking reckon- 
ing with the genetics literature that 
will probably take years. 

ExAC has also driven homea point 
that Goldstein and other research- 
ers have made repeatedly: that fail- 
ing to include people from Asian, 
African, Latino and other non- 
European ancestries is holding back 
understanding of how genes influ- 
ence disease by limiting the view of 
human genetic diversity (see page 
161). There is now a fresh impetus to 
include under-represented groups in 
planned studies linking genetics and 
health information on large numbers 
of people, such as the US Precision 
Medicine Initiative. 

For Vallabh and Minikel, ExAC 
provided a disheartening confirma- 
tion, butalso some promising insight. 
Minikel's studies have identified? 
three people in ExAC with mutations 
that should silence one of the two 
copies of the prion protein gene. If 
they can live with a limited amount of 
functioning protein, perhaps a drug 
could be made that would silence 
the defective protein in Vallabh, pre- 
venting prion aggregation and disease 
progression without dangerous side 
effects. Minikel got in touch with one 
of the individuals, aman in Sweden, 


ing anyway” Minikel says. “There wasnt any 
moment where we said, ‘Ah, this is the worst 
news! Weld already gotten the worst news” 


HUMAN KNOCKOUTS 

ExAC is revealing a lot about genes through 
the frequency of mutations, MacArthur and 
his team found’ 3,200 genes that are almost 
never severely mutated in any of the ExAC 
genomes —a signal that these genes areimpor- 
tant. And yet 72% of them have never before 
been linked to disease. Researchers are eager 
to study whether some of these genes play 
unappreciated parts in illness. 

Conversely, the group has found nearly 
180,000 instances of mutations so severe 
that they should render their protein prod- 
ucts completely inactive. Scientists have long 
studied genes by knocking them out in ani- 
mals such as mice, so that they don't work. By 


looking at the symptoms that develop, they can 
study what the genes do. But that has never 
been possible in humans. Now, researchers are 
eager to study these natural human knockouts 
to understand what they can reveal about how 
diseases develop or may be cured. MacArthur 
and other researchers are gearing up to prior- 
itize which human knockout genes to study 
and how best to contact the people carrying 
them for further study. 

But it will have to wait until he completes 
the second phase of EXAC. Due to be unveiled 
at the ASHG meeting in Vancouver, Canada, 
this month, it will double the data set’s size 
to 135,000 exomes and include some 15,000 
whole-genome sequences, which should allow 
researchers to explore mutations in regulatory 
regions of the genome that are not captured by 
‘exome sequencing. 

ExACis quietly becoming a standard tool in 
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who agreed to donate some cells for research. 
Minikel and Vallabh have now joined the lab of, 
biochemist Stuart Schreiber at the Broad Insti- 
tute, where they are working full-time to find 
candidate drugs to treat prion disease. 

The couple exemplifies the challenge of 
translating ExAC data into real medical ben- 
efits. “We can't go back from this,” Vallabh says. 
“We have to go through it.” Their situation 
couldn't be more illustrative of what is at stake: 
Vallabh is now 32 — just 20 years younger than 
her mother was when she died, She has no time 
to waste. m SEEEDITORIAL P40 


Erika Check Hayden writes for Nature from 
San Francisco. 
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The troubled 
minds of migrants 


THE REFUGEES AND MIGRANTS SURGING INTO EUROPE ARE SUFFERING VERY HIGH 
LEVELS OF PSYCHIATRIC DISORDERS. RESEARCHERS ARE STRUGGLING TO HELP. 


By Alison Abbott 


nan ice-cold day in January, clinical psycholo- 
gist Emily Holmes picked up a stack of empty 
diaries and went down to Stockholm’s central 
train station in search of refugees. She didn't 
have to look hard. Crowds of lost-looking 
young people were milling around the con- 
course, in clothes too flimsy for the freezing 
air. “It struck me hard to see how thin some of 
the young men were’ she says. 

Holmes, who works at Stockholm’s Karo- 
linska Institute, was seeking help with her 
research — a pilot project on post-traumat 
stress disorder (PTSD), which is all too com- 
mon in refugees. She wanted to see whether 
they would be willing to spend a week noting 
down any flashbacks — fragmented memo- 
ries ofa trauma that rush unbidden into the 
mind and torment those with PTSD, She easily 
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found volunteers. And when they returned the diaries, Holmes was 
shocked to see that they reported an average of two a day — many more 
than the PTSD sufferers she routinely dealt with. “My heart went out 
to them,” she says. “They managed to travel thousands of kilometres to 
find their way to safety with this level of symptoms.” 

Europe is experiencing the largest movement of people since the 
Second World War. Last year, more than 1.2 million people applied for 
asylum in the European Union — and those numbers underestimate 
the scale of the problem. Germany, which has taken in the lion’s share 
of people, reckons that it received more than a million refugees in 2015, 
tens of thousands of whom have yet to officially apply for asylum, Most 
came from Syria, Afghanistan and Iraq. Many have experienced war, 
shock, upheaval and terrible journeys, and they often have poor physical 
health. The crisis has attracted global attention and sparked political 
tension as countries struggle to accommodate and integrate the influx. 

What hasn't been widely discussed is the enormous burden of 
mental-health disorders in migrants and refugees. Clinical psycholo- 
gist Thomas Elbert from the University of Konstanz in Germany is 
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Arrival ina foreign, hostile 
‘country causes many 
refugees great stress. 


conducting a local survey of refugees that 
suggests “more than half of those who 
arrived in Germany in the last few years 
show signs of mental disorder, and a quar- 
ter of them have a PTSD, anxiety or depression that wor't get better 
without help’ Previous research shows that refugees and migrants are 
also ata slightly increased risk of developing schizophrenia. 

“Itisa public-health tragedy — and it’s a scandal that itis not recog- 
nized as such, as a physical epidemic would be” says epidemiologist 
James Kirkbride of University College London. 

Doctors and researchers are starting to take action. Holmes and 
other psychologists and psychiatrists are working with refugees to 
develop practical, cheap and effective therapies for trauma-related dis- 
orders — therapies that could be quickly deployed on this group. Other 
scientists want to work with local refugees to understand more about 
how the different types of stresses they suffer play out in their brains, and 
to learn more about the basic biology of psychiatric disorders. 

Scientists hope that their studies will help them to deal with other 
displaced populations, and help policymakers to accommodate the cur- 
rent influx. Politicians have been too slow to consider mental health 
when they call for refugees to integrate quickly, Elbert says. “Itisillusory 
to think that people can learn a new language and find work when they 
can't function properly mentally. If we want quick integration, we need 
an immediate plan for mental health.” 


MAKING A NEW LIFE 

‘Amira is a clinical psychologist and a refugee from Syria. When the war 
there started, she worked in camps for Syrian refugees in Jordan. She saw 
people who had been physically attacked, women who had been raped 
and children who had been neglected. The symptoms of PTSD were 
clear, and she knows that many refugees have depression and anxiety, 
too. She asked that her real name not be used. 

She arrived in Sweden at the end of December 2015, and wanted to 
help other refugees but was not allowed to work at first. She tried to 
make contacts in Stockholm and joined a language course for refugees; 
she felt very alone but carried on, Now, she has a 6-month position. 
“[ met many children who have experienced war.” she says. “We feel sad, 
[about] how our children think and how they feel. I have a child and 
Ltry to protect him?” 

Researchers already have a wealth of evidence about the mental 
health of migrant and refugee populations around the world. (The 
United Nations defines refugees as people fleeing armed conflict or 
persecution and migrants as people who choose to move to improve 
their lives. Asylum seekers are those seeking official refugee status; but 
sometimes different definitions are used.) A 2005 meta-analysis of stud- 
ies performed mostly in northern Europe showed that first- and second- 
generation migrants were at much greater risk of schizophrenia than 
non-migrants — and that those from developing countries were more 
at risk than those from developed ones‘. 

‘A large cohort study published in March looked at 1.3 million people 
who had arrived in Sweden before 2011 (see ‘Migrant crisis’). Refugees 
hada threefold higher incidence of schizophrenia and other psychotic 
disorders than native-born Swedes, and a 66% higher incidence than 
migrants who were not refugees”. (The overall risk for refugees and 
migrants still remains comparatively low, at perhaps 2~3%.) Kirkbride, 
an author on the study, says that his team’s more recent analysis of UK 
migration data suggests that the level of increased risk of psychotic dis- 
orders may depend on how old people were when they migrated — with 
children potentially at greater risk. 

‘Those who stand out most seem to be particularly vulnerable. The 
2005 meta-analysis showed that black migrants in a mostly white popu- 
lation had an almost fivefold increased risk of psychotic disorders’. And 
the risk is higher for migrants living in neighbourhoods with a low 
proportion of residents from their own ethnic group compared with 
those surrounded by many of their own ethnicity’, 

Psychiatrist Andreas Meyer-Lindenberg from the Central Institute 


CORRECTED 24 OCTOBER 2016 | 13 OCTOBER 2016 | VOL S38 


FEATURE [ EWS | 


for Mental Health in Mannheim, Germany, is one of those trying to 
understand the brain mechanisms involved. He has already studied 
other populations with an above-average risk of psychosis, such as city 
dwellers! and ethnic minorities’, The work suggested that the brains 
of these people are overly sensitive to social stress, such as a stream of 
disapproving feedback. 

‘Thanks to a grant awarded last month from the state government of 
Baden-Wiirttemberg, Meyer-Lindenberg plans to extend his studies 
by recruiting 200 refugees and 200 people from the local community. 
The refugees will use smartphones to note their state of mind — such 
as feelings of suspiciousness — and they will later receive brain scans. 
The eventual aim is to find patterns in the data that indicate peo- 
ple with abnormal processing of social stress, who may therefore be 
at increased risk of mental illness. Jean-Paul Selten, a psychiatrist at 
Maastricht University in the Netherlands, is also exploring the poisonous 
nature of social stress. He proposes that pressures such as social exclu- 
sion raise the risk of psychosis by changing the brain’ sensitivity to the 
neurotransmitter dopamine’. 

Germany's integration plans, consolidated in a law that came into 
force in August, involve distributing refugees across the country to avoid 
the creation of large, isolated ethnic communities. That could be prob- 
lematicifit increases people’ isolation, but Meyer-Lindenberg says it’s 
“actually a good policy” — because other people in the community get 
to know refugees, and this usually reduces xenophobia, another major 
source of social stress. 


“IT IS A PUBLIC-HEALTH TRAGEDY — AND IT'S A 
SCANDAL THAT IT IS NOT RECOGNIZED AS SUCH.” 


Politicians consider integration to be essential for security, among 
other things. A handful of terrorist attacks in Europe over the past two 
years have been carried out by refugees or others with a migrant back- 
ground who had been known to have a history of psychiatric problems. 
But doctors and researchers are extremely wary about making a link 
between refugees or migrants and terrorist acts, pointing out that very 
few of those with mental-health problems become violent, regardless of 
their origins. The security concern simply accentuates the need to help 
all those with mental-health problems across the population, they s 


‘THE STRESS OF UPHEAVAL 
Psychologists recognize three windows of extreme stress for refugees: 
the often violent traumas in their home countries that led to their flight; 
the journey itself, and the arrival, when people are thrust into foreign 
country. “The latter ‘post-migration’ phase is becoming increasingly 
important,” says psychiatrist Malek Bajbouj from the Charité university 
hospital in Berlin, “Suddenly they realize they have lost everything, have 
no control over aspects of their lives and no social standing” 

In February, Bajbouj — who is of Syrian descent and speaks 
Arabic — and two colleagues from other departments opened a clearing 
centre for refugees with mental-health problems, the first ofits kind in 
Germany. It a quiet building, a former hospital in the centre of Ber- 
Jin — but already 1,500 troubled people have passed through its doors. 
“Refugees may arrive in Germany with great hope, but then find them- 
selves stuck for months in camps with no apparent prospects,” he says. 
“When we ask them what their greatest stressors are, they typically refer 
not to their traumatic memories, but to their current frustrations” 

The biggest challenge for Bajbouj and others is the sheer volume of 
people in need of help. They must be assisted quickly and cheaply, in 
ways that take the pressure off overstretched health professionals. At the 
clearing centre, three psychiatrists assess visitors rapidly, categorizing 
them into those who require low-level or more-intensive psychiatric 
help and those who can be aided by social workers. A lot of effort goes 
into teaching about stress management and the science behind men- 
tal health. “Some people from rural areas hold the Djinn responsible 
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for their moods” says Bajbouj, “and we teach them that symptoms like 
sleeplessness and depression are biologically based and can be treated.” 
Elbert wants to see similar triage systems put in place across Germany 
Ina paper to be published next month, he and a group of colleagues 
call for a three-tiered approach. Refugees would initially be helped by 
bilingual laypeople — ideally migrants or refugees themselves — who 
are trained to guide people through the German health system (tier one) 
or to offer trauma counselling (tier two). Those in most need would 
progress to tier three: qualified psychologists or psychiatrists. 


PEACE OF MIND 

Training laypeople seems to work in emergency situations, Elbert, 
together with Sarah Ayoughi, a clinical psychologist from the psycho- 
social-care organization Ipso, carried out a randomized controlled study 
of people with mental-health conditions in north Afghanistan who 
received psychosocial counselling — a type of talking therapy — con- 
ducted by local physicians who had no previous education in psychology 
or psychiatry, but who were specially trained for the trial. Just 5~8 ses- 
sions improved symptoms of depression and anxiety for up to 3 months’ 

‘And several studies, including a 2011 randomized controlled trial of 
former child soldiers in northern Uganda’, show that an approach called 
narration exposure therapy (NET), carried out by trained lay counsel- 
lors, can reduce the severity of PTSD symptoms. Elbert started develop- 
ing NET with his wife Maggie Schauer, also a clinical psychologist at 
the University of Konstanz, when they were working with refugees in 
Kosovo in the late 1990s. It exploits new understanding of how memo- 
ries are linked with fear circuitry in the brain, A traumatized person 
works with a therapist or counsellor to construct. narrative of their lives 
and anchor their traumatic experiences in the correct time and place. 

Pragmatic as the three-tiered approach may sound, it won't be 
simple to introduce in Germany, Professional associations are resist- 
ant to allowing people without formal qualifications to help out with 
psychotherapies, and various regulations could get in the way. 

But while the federal government ponders what to do, some 
programmes are starting up with regional government support. Schauer 
has received €100,000 (US$112,000) to test whether NET works as well 
on refugees in Germany asiit has in war-torn countries, And Ayoughiis 
organizing the training of refugees in Erfurt in Germany, with additional 
support from the Google Foundation, 

Bajbouj thinks that the political desire to get refugees into the 
workforce fast may end up easing the way for more relaxed rules about 
psychotherapy. And there is another way to deliver inexpensive men- 
-health care: through the Internet and apps. He is developing an 
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Arabic-language version of the smartphone app PTSD Coach, which 
provides education, a personalized emergency plan, self-assessment 
and 25 different techniques to regulate stress. He i testing it in the Arab 
Outpatient Centre he opened at the Charité in 2008, 

In Stockholm, Holmes also hopes that technology can help. The aim 
ofher workis to test whether it's possible to subdue PTSD-linked emo- 
tional flashbacks ifa person immediately plays a video game on their 
phone that competes for cognitive space in the brain —a technique that 
she has seen work in laboratory tests’. “The important thing now is to 
develop simple new approaches to therapy that can be scaled up, and to 
prove that they help,” she says. 

Sweden, which has taken in a relatively large number of refugees, 
is also starting mental-health programmes. Early this year, local 
authorities rolled outa plan to make it easier for refugees to access sup- 
port: health checks will include more questions about states of mind, 
and those recognized as being in need will be channelled towards 
psychological or psychiatric support. 

‘The flow of refugees and migrants has eased this year, in part because 
‘Turkey has agreed to take back those who illegally entered EU countries 
from there. But people keep coming. This August, more than 18,000 
refugees entered Germany to seek asylum, And even if the current 
crisis eases, conflict, poverty, natural catastrophe and climate change 
will inevitably drive fresh waves of migration around the world, “We' 
learnt lessons about mental health from crises in war-torn countries.” 
says Ayoughi, “and we can apply these in the refugee crisis in Europe 
now if we get the support” Then, perhaps, lessons learnt in Europe could 
feed back to war zones 

Bajbouj has been calling for a ‘migration think-tank, a permanent 
institution in Germany where scientists of different disciplines can come 
together to work out what needs to be done. “The challenges are not 
just about mental health, but about education, integration into the work 
force and much more," he says. “But mental health impacts everything” = 


Alison Abbott is Nature’ senior European correspondent. 


1. Cantor-Graae, E.& Selten, J-P.Am J. Psychiatry 162, 12-24 (2008). 
2. Hollander, A-C. et a. Br Med. J. 352, 1030 (2016). 

3. Kirkbride, J.B, Jones, PB, Ullrich, S. & Coid, J. W. Schizophr. Bull 40, 169-180 
(2014), 

Lederbogen, Fet al Nature 474, 498-501 (2011). 

Akdeniz, C. eta iatry 71, 672-680 (2014) 

Selten, J-P, van der Ven, E, Rutten, B. PF & Cantor-Graae, E Schizophr. Bull. 3 
1180-1186 (2013). 

Ayoughi,S. Missmahi, |, Welerstal, R, & Elbert, T. BMC Psychiatry 12, 14 (2012) 
Er, V, Peiffer, A, Schauer, E., Elbert, 7. & Neuner,F JAMA 306, 603-512 (2011 
James, EL et al Psychol Sci 26, 1201-1215 (2015). 


oe 


ern 


ringer Nature. All rights res 


COMMENT 


CIMIES To inform policy, hisTORY A biography of °2) pourics The causes Einstein | OBITUARY Roger Yonchien 
urban scholarship must get Enrico Fermi, Italy’s fallible championed offer a Tsien, fluorescent-biology 
organized and funded p65 atomic physicist p168 window onhistimep170_| pioneer, remembered p172 


/eR POST/GETTY 


§ 


a 


Genomics is 
failing on diversity 


An analysis by Alice B. Popejoy and Stephanie M. Fullerton indicates that some 
populations are still being left behind on the road to precision medicine. 


2009 analysis revealed that 96% of 
Ares in genome-wide asso- 
ciation studies (GWAS) were of 
European descent!. Such studies scan the 
genomes of thousands of people to find vari- 
ants associated with disease traits. The find- 
ing prompted warnings that a much broader 
range of populations should be investigated” 
to avoid genomic medicine being of benefit 
merely to “a privileged few’. 
Seven years on, we've updated that 
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analysis. Our findings indicate that the 
proportion of individuals included in 
GWAS who are not of European descent 
has increased to nearly 20%. Much of this 
rise, however, is a result of more studies 
being done in Asia on populations of Asian 
ancestry, The degree to which people of Afri- 
can and Latin American ancestry, Hispanic 
people and indigenous peoples are repre- 
sented in GWAS has barely shifted. 

Thus, more than 20 years after the 
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US National Institutes of Health (NIH) 
mandated the inclusion of diverse partici- 
pants in the biomedical research it funds, 
GWAS funded by the NIH and other sources 
are continuing to miss a vast portion of the 
world’ genetic variation. 

Over the past decade, GWAS have been 
the preferred tool for discovering the genetic 
factors involved in common diseases. Tens of 
thousands of significant associations between 
genetic variants and biological traits have > 
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PERSISTENT BIAS 


Over the past seven years, the proportion of participants in genome-wide 
association studies (GWAS) that are of Asian ancestry has increased, 
Groups of other ancestries continue to be very poorly represented. 


2009 2016 


373 studies 2,511 studies 
7 million samples 35 million samples 


96% 


European 
ancestry a 
Asian 
| a 
other non 
European 


oe 
4% Non- 
European 


ancestry 


suits, 


1% 
0.15%, 

0.06%. 0.54% 

0.06%, 0.28% 


0.08%, 005% 


3% 18% 
of all 2008, of all 2016 
samples BREAKDOWN somples 
Proportion of non-European 
‘ancestry samples 
. 
Asian 
ancestry 
. 
atnean 
ancestry 
Mixed 
ancestry 
= 
Hispanie & Latin 
American ancestry 
. 
Pacific 
islander 
. 
Arab & Middle 
Eastern 
. 
Native 
Peoples 
3% 
0.57%. 


162 | NATURE | VOL 548 | 13 OCTOBER 2016 
© 2016 Macr 


> nowbeen found, and many of these asso- 
ciations have helped geneticists to uncover 
biological mechanisms underpinning con- 
ditions from diabetes to schizophrenia. 

‘The most comprehensive, publicly acces- 
sible summary of human geneticassociation 
research is the GWAS Catalog (www.ebi. 
ac.uk/gwas) produced by the US National 
Human Genome Research Institute in part- 
nership with the European Bioinformatics 
Institute. Every week, the curators of the 
catalogue receive automatic alerts of any 
new English-language GWAS reported in 
PubMed. These studies are then put through 
two rounds of data extraction and validation 
before being added to the catalogue. Among 
the data extracted from each study are the 
race, ethnicity or ancestry (as described 
by the authors of the study) of the subjects 
whose samples were analysed. 


DATA GATHERING 

To determine ancestry, we analysed the sam- 
ple descriptions included in the GWAS Cat- 
alog with an approach similar to that used 
in 2009 (see Supplementary Information; 
go.nature.com/2dv2faf). 

Asof August, 2,511 studies involving nearly 
35 million samples were included in the 
GWAS Catalog. This is a more than 2,000% 
increase in sample number from the 2009 
analysis (which looked at roughly 1.7 million 
samples across 373 independent studies".) 

We found considerable heterogeneity in 
descriptions, For example, 26 terms, includ- 
ing ‘black cases’ and ‘sub-Saharan African, 
were used to describe people of African 
ancestry. The most geographically specific 
and informative descriptions were those 
used for samples of European origin, as 
previous studies have shown’, 

During the past seven years, the propor- 
tion of samples used in catalogued GWAS 
from participants who are not of European 
descent has increased fivefold (see “Persistent 
bias’). Yet nearly 78% of this growth is due 
to an increase in the number of samples 
from Japan, China, Korea, India and other 
populations from east Asia, south Asia and 
southeast Asia. 

‘Together, individuals of African and Latin 
American ancestry, Hispanic people (indi- 
viduals descended from Spanish-speaking 
cultures in central or South America living 
in the United States) and native or indig- 
enous peoples represent less than 4% of all 
samples analysed. Collectively, these are the 
most vulnerable and traditionally under- 
served populations in many of the world’s 
richest nations. 

The proportion of samples from individu- 
als of African ancestry has increased by 2.5%, 
and the proportion of people of Hispanic or 
of Latin American ancestry by around 0.5%. 
In the case of indigenous peoples (including 
Native Americans, Australian Aboriginals 
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and Pacific Islanders), representation has 
decreased slightly since 2009. 

By looking up GWAS involving only Asian 
participants in PubMed (349 studies), we 
found the institution of the first author of 
each study. Around 93% of these studies 
were conducted in Asian countries. That 
the number of GWAS involving local popu- 
lations has risen so much in Asia is hearten- 
ing, But with such a large increase overall in 
the number of GWAS performed in the past 
seven years, the lack of growth in representa 
tion from other populations is remarkable 
and deeply disconcerting, 

Ofcourse, our analysis does not account for 
the resampling of data sets across independ- 
ent studies. Information from some cohorts 
in publicly available databases has been 
used multiple times for different GWAS (see 
Supplementary Information), So the numer- 
ous samples of European ancestry used in 
GWAS could come from a smaller number 
ofactual individuals. Yet if European-ancestry 
data sets are resampled more often than oth- 
ets, this in itself reflects population-specific 
differences in research effort. 


WHY THE BIAS? 
‘The continuing European bias in GWAS is 
likely to be the result of logistical, systemic 
and historical factors. 

‘The more populations that are included in 
a study, the more variables thereare to control 
for. In trying to keep things as simple as pos- 
sible, geneticists probably favour the use of 
existing cohorts, such as that of the Framing- 
ham Heart Study, or other large data sets gen- 
erated by well-established medical centres. 

Such organizations collect samples and 
information from people in the same geo- 
graphic location, who are presumed to be 
exposed to shared environmental factors, 
using uniform collection practices, But for 
various reasons, some populations are eas- 
ily bypassed, People may have limited access 
to certain medical centres, for example, or, 
for cultural or historical reasons, elect not to 
contribute their samples to research. 

Genotype and phenotype informa- 
tion from diverse populations is avail- 
able. Researchers using NIH funding are 
required to submit any such information 
they have collected to dbGaP, a public data- 
base of genotypes and phenotypes. Analo- 
gous recommendations are made by other 
major biomedical funders outside the United 
States. In Europe, geneticists are encouraged 
to share similar data through the European 
Genome-phenome Archive (EGA). Yet for 
various reasons (such as the difficulties of 
getting certain kinds of studies funded, a 
preference for larger sample sizes, a percep- 
tion that the analysis will be simplified by 
using data from one ancestry group or lack 
of awareness of the diversity of data sets avail- 
able) geneticists seem to be preferentially 
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Astudy of Greenlandic Inuits revealed a previously missed genetic variant associated with height. 


using cohorts of European ancestry. 

Repeated sampling is almost certainly 
exacerbating the problem. Indeed, to some 
degree, the over-representation of people of 
European ancestry in GWAS may bea legacy 
of earlier biases. 


WHAT'S MISSED 
Irrespective of what's driving it, the contin- 
ued under-representation of populations of 
mixed ancestry or of people whose ancestry 

isnot European isa problem. 
Until they are able to conduct amply 
powered GWAS on 


“European each major ancestral 
ancestry in population across 
GWAS may the world, geneticists 
bealegacy will continue to miss 
of earlier important informa- 
biases.” tion about disease 


biology. They won't 
know how many of the thousands of asso- 
ciations between variants and diseases, and 
between variants and responses to drugs, 
observed in populations of European 
ancestry replicate in other groups. And 
opportunities will be missed to discover 
new associations with disease traits in other 
populations. 

For example, for 25% of the variants in 
European Americans that GWAS have identi- 
fied as being associated with body mass index, 
type2 diabetes and lipid levels, the strength of 
the association differs in at least one out of five 
populations of non-European ancestry". This 
means that a variant thatis associated with 
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diabetes may confera different risk of disease 
in someone of European ancestry than in, say, 
an individual of African ancestry. 

Likewise, population-specific differences 
in the frequencies of variants associated 
with drug metabolism may mean that cer- 
tain drugs will be safer and more effective 
in some populations than in others. The 
CYP2D6 gene, for instance, is involved in the 
metabolism of many commonly prescribed 
drugs, including tamoxifen, which is used 
to treat breast cancer. More than 100 differ- 
ent variants of this gene (alleles) — many of 
which affect an individual's ability to safely 
digest and use a drug’ — occur at different 
frequencies in different populations. 

Several associations between drug 
responses and clinically relevant genetic 
variants have already been identified with 
GWAS. In some cases in which the effect 
sizes are large, significant results have 
been found with as few as 51 cases and 282 
controls'. (In this case, patients had differ- 
ent reactions to the lipid-lowering drug 
simvastatin.) Although physicians must 
weigh the costs and benefits of using phar- 
macogenetic testing to guide prescription 
and dosage decisions for individual patients, 
these findings suggest that the small samples 
that have already been collected from under- 
represented populations could yield leads 
that have not been identified in populations 
of European ancestry. 

Conducting analyses in other populations 
is also crucial for assessing the accuracy and 
broader relevance of a finding, Itis possible, 
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for example, that associations between certain 
disease traits and variants found in European 
populations that cannot be replicated in other 
populations are actually false positives. In 
fact, the analysis of a broader representation 
of populations can reveal insights that would 
have otherwise been missed. 

‘A genome-wide scan in a Greenlandic 
Inuit population, for example, found last 
year that a single-nucleotide polymorphism 
(SNP) in a fatty-acid enzyme affects height 
in both this population and Europeans’. The 
authors suggest that previous GWAS may 
have missed this variant because of its low fre- 
quency in Europeans (0.017 compared to 0.98 
in the Greenlandic Inuit population) —even 
though it has a much greater effect on height 
than others previously identified through 
GWAS. 


NEW DIRECTIONS 
Increasingly, the sequencing of whole 
genomes and whole exomes (that is, the com- 
plete set of protein-coding genes) are begin- 
ning to be used more widely for discovery 
as costs fall. These may prove more fruitful 
than GWAS for individual-level diagnosisand 
treatment. Certainly, they are better suited to 
revealing rare variations that are clinically 
informative. (GWAS identify known genetic 
markers associated with a trait, but not neces- 
sarily the mutations that cause the disease.) 
Studies that use these new approaches have 
been slightly more successful than GWAS at 
recruiting a greater diversity of populations. 
For example, the international Exome Aggre- 
gation Consortium hosts data on genetic 
variants from more than 60,000 samples, of 
which 8.6% are from people of African ances- 
try, 9.5% are from people of Latin American 
ancestry, and 60.4% are from people of Euro- 
pean ancestry" (see page 154). The remain- 
ing samples (21.5%) are from south Asia, east 
Asia and the Middle East. Similarly, the Trans- 
Omics for Precision Medicine whole-genome 
sequencing project of the US National Heart, 
Lung and Blood Institute is growing and cur- 
rently holds 62,000 samples, of which 50% 
are from European Americans, 30% are from 
African Americans, 10% are from Hispanics 
or Latin Americans, and 8% are from Asians. 
Often, large sample sizes are needed to 
uncover rare genetic variants associated 
with disease traits. In fact, this realization 
— from the first generation of exome dis- 
covery studies — is driving new interest in 
ultracheap genotyping arrays (collections 
of targeted fragments of DNA). Using such 
arrays, geneticists can speed up the sequenc- 
ing process and analyse many targeted sam- 
ples in one go. Exome sequencing combined 
with the use of genotyping arrays is likely to 
be the favoured approach over the next dec- 
ade. Nonetheless, GWAS remains a useful 
precursor to such studies, as well as to those 
involving whole-genome sequencing. 
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And emerging data indicate that inequali- 
ties in health care are being exacerbated by 
findings from whole-exome and -genome 
sequencing, despite their greater sample 
diversity compared with GWAS. Patients 
of African and Asian ancestry are currently 
more likely than those of European ancestry 
to receive ambiguous genetic test results after 


exome sequencing, 
“Historical, or be told that they 
cultural, have variants of 
scientific and unknown signifi- 
logisticalfactors cance’. Further- 
are sustaining more, patients of 
anembarrassing African ancestry 
biasin are more likely 
genomics.” than those of 


European ances- 
try to be wrongly told that a mutation they 
carry increases their risk of developing a 
life-threatening heart condition known as 
hypertrophic cardiomyopathy". Had more 
ethnically diverse controls been included in 
the candidate-gene studies that identified 
these associations, population-specific dif- 
ferences in the frequency of presumed dis- 
ease-causing variants would have revealed a 
false positive at the outset. 


WHAT NOW? 
‘The message being broadcast by the scien- 
tific and medical genomics community to 
the rest of the world is currently a harmful 
and misleading one: the genomes of Euro- 
pean descendants matter the most. 

Certain efforts, combined with newer 
data-gathering initiatives, can help to move 
the needle in the right direction. Some inves- 
tigators in genomics focus exclusively on 
diverse populations. For instance, landmark 
trans-ethnic studies have identified genes 
associated with traits such as diabetes, lev- 
els of lipids and other metabolites, prostate 
cancer and gene expression", Also, various 
ventures aim to boost genomics studies in 
under-represented populations worldwide 
The Human Heredity and Health in Africa 
Consortium, for example, was established by 
the NIH and the Wellcome Trust in London 
in 2012 to help build infrastructure and 
genomics expertise across Africa. 

In our view, more fundamental changes are 
needed — both top-down and bottom-up. 
Funding agencies should develop financial 
incentives for the creation of diverse cohorts 
of study participants. One way for them to do 
this would be to prioritize grant requests that 
propose investigations in populations of non- 
European (and especially of African) ances- 
try. Given limited budgets, this may need to 
happen hand in hand with a reduction in 
the funding of research on existing cohorts 
of European ancestry for traits and diseases 
that have been relatively well character- 
ized. (Around 850 genetic associations with 
height have now been reported by roughly 
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30 independent GWAS — the vast majority of 
which have been conducted using individuals 
of European ancestry.) 

Further, all genomics researchers need to 
recognize the importance of studying under- 
represented populations to ensure that the 
benefits of research are distributed fairly and 
to maximize the potential for discovery. On 
a practical level, training programmes and 
new infrastructure, such as good health- 
care clinics that provide genetic testing in 
predominantly black or Hispanic neigh- 
bourhoods, could enhance trust and allow 
people to engage in projects as stakeholders 
rather than as study participants. 

A culture shift is required at every level. 
Efforts to recruit participants for biomedical 
research in under-represented communities 
have been most successful when conducted 
by investigators of concordant racial or 
ethnic background, and in partnership with 
institutions trusted by those communi- 
ties'*— such as historically black colleges 
and universities in the United States. 

Indeed, toa large extent, the persistent bias 
in sampling in genomics mirrors the employ- 
ment trends evident in biomedical institu- 
tions worldwide. In the United States in 2012, 
less than 4% of the tenured and tenure-track 
faculty members in research-intensive bio- 
medical departments were African American, 
Hispanic or Native American", 

A complex web of historical, cultural, 
scientific and logistical factors is sustaining 
an embarrassing bias in genomics. Before 
precision medicine takes hold in clinical 
practice, we must correct its course. m 
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Washington DC at night. 


Scientists must have a say 
in the future of cities 


A United Nations conference seeks urban sustainability. But the agenda will fail 
without input from researchers, warn Timon McPhearson and colleagues. 


ore urban areas will be built in the 
next 30 years than ever before 
Growing settlements will increase 


demand for infrastructure, food, energy, 
water and housing. Simply meeting the 
projected urban expansion will breach the 
warming limit set by the 2015 Paris climate 
agreement. 

‘This week, the United Nations’ third 
major global cities conference, Habitat III, 
convenes in Quito, Ecuador. Held every 
20 years, this multilateral meeting will 
adopt a global framework for making cities 
more sustainable — the New Urban Agenda 
(NUA). Sadly, science was largely absent 
from the drafting process of the NUA. By 
contrast, expert evidence guided the Paris 
climate deal, the 2015 Sendai Framework 
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for Disaster Risk Reduction and the UN's 
2030 Agenda for Sustainable Development 
and its Sustainable Development Goals 
(SDGs). 

One reason is that the scientific commu- 
nity was unprepared for Habitat III. The few 
scientists invited to participate accepted a 
consultative role, nested among other 
public voices. Then, in late July, negotiators 
dropped the proposed multistakeholder 
panel, which would have formally embedded 
scientists and other non-state representatives 
in the implementation process, European 
Union members and other rich countries 
were concerned that the panel would be 
expensive. The final draft of the NUA 
brokered in New York last month failed to 
reverse this. It is thus necessary to argue the 


case once again for the importance of urban 
science and of establishing a science-policy 
interface for the NUA. 

Urban research is disparate, marginalized 
and ill-prepared to interact effectively 
with global policy. The Habitat III agenda 
requires a global community of urban 
biophysical and social scientists to assess 
developments and help direct progress. To 
achieve the SDGs and the NUA, the global 
urban research community must come 
together to develop institutions, funding 
mechanisms and research agendas. 


URBAN ACCELERATION 
Rapid urbanization is one of the biggest 
social transformations in human history” 


Cities are depleting resources and face new 
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risks caused by climate change. For example, 
disastrous floods in the past decade in the 
United States, Philippines, the United King- 
dom, India and China show how vulner- 
able coastal and riverside cities are to storm 
surges, with trillions of dollars’ worth of 
assets at stake’, 

Yet cities can also be engines of innova- 
tion. Here, the most progress is being made 
on climate change' and other sustainability 
goals’. For example, cities around the world 
are embracing nature-based infrastructure 
for adaptation and resilience, such as green 
roofs and wetland restoration’. 

City processes are complex and often 
far from equilibrium, displaying emergent 
properties and non- 
linear dynamics. 


Urban areas are dit) “Most e 

ficult to plan, manage a een hs 

and govern, and have Utfhtenorth; 
most needisin 


a rapacious appe- 
tite for energy and 
materials, with global 
environmental impact’. Urban challenges 
ask complex and interrelated questions 
about equity, justice, resilience, economic 
opportunity, infrastructure development, 
ecological restoration and more. 


the south.” 


COME TOGETHER 
Implementing, monitoring, evaluating and 
revising the NUA and related SDGs will 
require evidence from across the research 
community, from natural and social scien- 
tists to humanities scholars. To be useful to 
policymakers, urban research needs to be 
organized, representative and seen as legiti- 
mate. This is far from the case. 

Urban researchers are scattered across 
non-governmental organizations, gov- 
ernment agencies and community-based 
organizations, and are found both in and 
outside academia. They span many disci- 
plines and professions, including architec- 
ture, ecology, engineering and geography. 
People, funds and institutions are distrib- 
uted unevenly. 

‘Most urban scientists and resources are 
located in the global north and in large cit- 
ies, but the most pressing urban challenges 
tend to be found in the global south and in 
small to medium sized cities. Urban research 
and solutions are context-specific. The dif- 
ferent developmental trajectories of cities in 
Africa, Asia or Latin America may be at least 
as significant as the better-documented gap 
between northern and southern cities". 

Scholars must expand primary research 
in less-studied and rapidly changing urban 
contexts such as those found in Latin 
America and south and southeast Asia. 
For example, too little is known about the 
global interlocking system of cities in terms 
of material usage, ecosystems, social and 
political norms, migration, disease vectors 
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and innovation. Urban scientists need to 
better map and model these to provide 
information for planning, management 
and policymaking’. 

The skills of drawing together many 
sources of knowledge to inform global urban 
policy are in short supply. Professional certi- 
fication systems and the lack of interactions 
across sectors reinforce the isolation of 
specialists such as engineers, architects and 
planners. Paradoxically, urban researchers in 
the global south, who are forced to become 
generalists because of skill shortages, may 
have broader experience than their peers 
in the north, where academic practices and 
evaluations often reinforce specialization. 
Many southern scholars engage directly with 
urban communities and local and national 
policymakers. 

We view as inadequate one model that has 
been advocated during the Habitat III pro- 
cess for bringing together urban knowledge 
at the global scale — an urban equivalent 
of the Intergovernmental Panel on Climate 
Change (IPCC). Although the IPCC has 
been successful in focusing the efforts of the 
international climate-science community 
around specific policy-relevant questions, 
it has also proved slow and cumbersome. 
Urban science is broad and fast-moving. 
Even urban scientists do not necessarily 
agree on the most important research ques- 
tions, let alone the prescriptions. 


FIVE STEPS 
Weadvocate the following steps to boost the 
development and impact of urban science. 


Form a global urban scientific body. An 
international urban science platform should 
be formed to address the post-2030 agenda. 
It must enable broad science-policy inter- 
action and cross-city learning at a global 
scale. This could connect existing global net- 
works such as the IPCC, Intergovernmental 
Science-Policy Platform on Biodiversity 
and Ecosystem Services, UN-Habitat, the 
UN Environment Programme, the Future 
Earth Urban Knowledge-Action Network 
and the Sustainable Development Solutions 
Network. The body should be developed in 
consultation with scientists, professional 
societies and holders of urban knowledge at 
all levels, including scholars, civil servants 
and citizens. Governance must be inclusive 
and could be based on the polycentric model 
developed at Future Earth, a global platform 
for sustainability research, with regional 
hubs responding to local issues. 


Spread knowledge and institutions 
globally. Most research is in the north; 
most need is in the 
south. Inclusivity 
and diversity across 
geographic regions 


‘NATURE.COM 
For more on cities, see: 
nature,com/habitat3 
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and scientific domains is key to legitimacy 
and legibility. Major investment is needed 
in academic institutes that are sited at the 
nexus of urban research, policy and prac- 
tice in rapidly urbanizing cities. Mapping 
knowledge and institutions would help to 
uncover key geographic and thematic gaps. 


Boost funding for urban research. Truly 
global sources of research grants are needed 
to allow cross-comparison studies of cities 
and regions. These should be set up with 
support from national governments, devel- 
opment banks and private foundations. 
This would require large sums (one of the 
reasons that the multistakeholder panel was 
taken out of the final NUA draft). 


Support transdisciplinary research and 
synthesis. Communities with relevant 
knowledge must guide urban-development 
policy over the short and long term. Trans- 
disciplinary research must be supported 
through new sources of urban science fund- 
ing and organizations. Existing knowledge 
should be synthesized and fed into policy- 
making at all levels. 


Improve access to science-policy arenas. 
Urban scholars must have a clear role in 
the policy platforms that are emerging in 
the NUA and the wider multilateral system, 
suchas the links forming between the urban 
SDGs and the Future Earth Urban Knowl- 
edge-Action Network. 

It is imperative to scale up urban research 
and foster a scientific leadership to direct 
and critique global urban policymaking, 
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Enrico Fermi at the controls of the synchrocyclotron particle accelerator at the University of Chicago, Ilinois, in the 1950s. 


Fallible pontiff of physics 


Graham Farmelo assesses a 


T: megalomaniac physicist Edward 
Teller “was always certain that he was 
smarter than all his colleagues on the 
Manhattan Project, except one”, a younger 
colleague of his (Robert Sachs) told me in 
1993. That exception was Enrico Fermi, “the 
nearest we physicists had toa Pope’ 
Fermiinfallibility is one of the prominent 
themes of The Pape of Physics, the first popu- 
lar cradle-to-grave biography in English of 
the most famous Italian scientific investigator 
since Galileo Galilei. The authors are married: 
Bettina Hoerlin, formerly a health executive, 
and Gino Segré, a nephew of one of Fermi’ 
most distinguished colleagues, Emilio Segré. 
They each met their subject independently, 
and plainly found his career an inspiration. 
Fermi’ many contributions to nuclear 
science and technology — such as the crea- 
tion of the first nuclear reactor — have been 
recounted ad nauseam, so he might not seem 
the most promising candidate for a biogra- 
phy. Segré and Hoerlin, however, seek a new 
perspective. They quickly hit their stride with 
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alucid account of how 
Fermi was born in 
1901 toa middle-class 
family in Rome and 
became one of the very 
few physicists to be in 
the front rank in both 
theory and experi- 
ment. The authors 


engagingly describe The Popeot 
how Fermitaughthim- Physics: Enrico 
selfbasic mathematics Fermi and the 

and physics to a high Birth of the Atomic 


Age 


GIN 


standard by the time 
he was 17. Blessed with 
a prodigious ability to 
identify the essence 
of every physics problem, he matured as a 
researcher in the 1920s, at just the right time 
to make his mark on two open frontiers: 
nuclear science and quantum theory. 
Fermiwas often quicker than more formal 
thinkers to apply abstract ideas to improve the 
understanding of nature. In 1926 he became 


2016 


biography of star theorist-experimentalist Enrico Fermi. 


the first to use quantum theory to study large 
aggregations of electrons, one of a class of 
subatomic particles later called fermions, 
and he later set out the first quantum field 
theory of radioactive decay. Yet his forte 
was experimental physics, as he demonstrated 
at the Sapienza University of Rome, which 
appointed him a professor at the age of 24. 
He made the Italian capital one of the world’s 
most productive centres of modern physics. 

Fermi and his talented colleagues found 
that slow neutrons are remarkably effective 
at inducing radioactivity in some heavy 
chemical elements. That won him the phys- 
ics Nobel in 1938. However, when Fermi 
and his group studied the products of some 
of the nuclear reactions that they induced, 
their interpretations were sometimes wrong, 
as he later acknowledged. To be fair, the great 
radiochemist Otto Hahn and his group in 
Berlin made the same mistake at about the 
same time, before they recognized that 
uranium nuclei undergo the process later 
called fission. 
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‘The authors describe movingly how the 
rise of Fascism in Italy led the Fermi fam- 
ily to emigrate to the United States. Fermi 
arrived in New York City on 2 January 1939, 
and soon took up a post at Columbia Univer- 
sity. Although some US officials were wary 
of putting foreigners in leading positions in 
secret projects, Fermi was indisputably the 
best person to lead the development and con- 
struction of the first nuclear reactor, CP-1, 
which began operation on 2 December 1942 
acrucial stage in the development of nuclear 
weapons. Robert Oppenheimer invited him 
to join the Manhattan Project and created a 
unit named after him. ‘F division’ dealt with 
especially tough problems in experimental 
and theoretical nuclear physics. 

‘The biography sometimes has a hagio- 
graphic whiff. Segré and Hoerlin agree with 
the consensus that he was largely apoliti- 
cal and was “a scientist pure and simple’ as 
his Hungarian colleague Leo Szilard once 
described him. I suspect that the truth is 
deeper, although it will probably remain 
hidden. The authors tiptoe around some of 
his less endearing characteristics, such as 
his ability asa father. Fermi’s daughter told 
mein 1992 that he was “alwaysa distant fig- 
ure’, and that 40 years earlier her father had 
approached her out of the blue one day, sug- 
gesting that they get know each other bet- 
ter by doing a few experiments around the 
house on the then-new material Silly Putty. 
He did not repeat this initiative, which did 
little to improve their relationship. 

In my view, Segré and Hoerlin underplay 
Fermi’s considerable influence on young 
physicists after the Second World War. 
It would have been revealing to read, for 
example, more from the great theoreticians 
Murray Gell-Mann and Chen-Ning Yang, 
who worked closely with him. Likewise, it 
would have been rewarding to have heard 
more about Fermi’s ideas on the origin of 
cosmic rays and his thinking about the 
future of subatomic-particle accelerators. 
In January 1954, he gave a far-sighted lec- 
ture in which he envisaged the possibility 
of building an ultrahigh particle accelerator 
that girdled the entire planet by 1994. 

‘Ten months later, he was dead, Lying in 
his hospital bed the day after he had learnt 
that he had terminal stomach cancer, he told 
the astrophysicist Subrahmanya Chan- 
drasekhar: “For a man past 50, nothing 
essentially new can happen” Had he lived 
two decades longer, that would not have 
been true. Physicists were soon to make 
discoveries that fundamentally altered 
their understanding of nature — another 
reminder that Fermi’s foresight was, like 
that ofall putative scientific popes, fallible. 
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Books in brief 


Welcome to the Universe: An Astrophysical Tour 

Neil deGrasse Tyson, Michael A. Strauss and J. Richard Gott 
PRINCETON UNIVERSITY PRESS (2016) 

As citizens of the cosmos, we are duty bound to explore it. So opine 
astrophysicists Neil deGrasse Tyson, Michael Strauss and Richard 
Gott, guides on this bracing expedition through dusty galactic 
hinterlands and the vast theoretical vistas of Albert Einstein's work. 
Each is a master at untangling the abstruse through metaphor: Tyson 
crams 100 million elephants into a thimble to illustrate neutron-star 
density, and Gott recounts John Archibald Wheeler demonstrating 
entropy by mixing tea and water and throwing it into a ‘black hole’. 


The Attention Merchants: The Epic Scramble to Get Inside Our Heads 
Tim Wu KnoPr (2016) 

Media scholar Tim Wu plunges into the noisome history of 
“attention harvesting” — the commodification of human attention 
by industry and government. It began, Wu reveals, with the 
juxtaposition of advertisements and lurid news in 1830s gutter 
journalism, and persisted in the engineered demands of “scientific 
advertising’, the efforts of propagandist Edward Bernays (who 
persuaded women to smoke) and the infiltration of fast-food ads 
into US schools. To evade this induced narcosis and reclaim lived 
experience, Wu argues, we must wean ourselves off the digital 


The Great Derangement: Climate Change and the Unthinkable 
Amitav Ghosh UNIVERSITY OF CHICAGO PRESS (2016) 

Resistance to the grim realities of climate change is so widespread 
that the crisis barely figures in literary fiction, notes writer 

Amitav Ghosh. Branding our era of denial and inertia the Great 
Derangement, Ghosh looks in turn at literature, history and politics 
to examine this failure, noting that extreme events such as 2012's 
Hurricane Sandy are so freakish that they seem inexpressible. The 
solution, he argues, lies in collective action as well as scientific and 
governmental involvement — and in a resurgence in aur imaginative 
capacity to envision human existence anew. 


Cat Wars: The Devastating Consequences of a Cuddly Killer 
Peter P. Marra and Chris Santella PRINCETON UNIVERSITY PRESS (2016) 
Among the hundreds of millions of domestic cats, many range 
freely. That group is effectively a death squad for songbirds, 
killing an estimated 4 billion US avifauna a year; globally, island 
cats drive 14% of vertebrate extinctions. This deeply researched 
overview by conservation scientist Peter Marra and writer Chris 
Santella interlaces discussions of feline domestication and avian 
conservation with the science of decline and of feline spillover 
diseases. It culminates with a stark choice: control free-ranging cats 
or witness the ongoing erosion of affected ecosystems. 


Assassin of Youth: A Kaleidoscopic History of Harry J. Anslinger’s 
War on Drugs 

Alexandra Chasin UNIVERSITY OF CHICAGO PRESS (2016) 

Harry Anslinger helmed the US Federal Bureau of Narcotics from 
1930 to 1962, shaping US drug policy through what Alexandra 
Chasin calls “an elaborately disastrous set of policies and laws”. In 
this idiosyncratic chronicle, Chasin paces the trail from temperance 
to today, when nearly half the inmates of US jails are incarcerated 
for drug offences. A sorry tale of how one man's racial prejudice and 
predilection for prohibition led to a colossal policy failure. Barbara Kiser 
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Einstein the statesman 


Nancy Thorndike Greenspan enjoys a study of the physicist as engaged public figure. 


Ibert Einstein's dreams have been 
Aviv his love life scrutinized, 

his letters parsed. His brain has been 
dissected and driven 5,000 kilometres across 
the United States. In a way, he has survived 
ital 

In Einstein and Twentieth-Century Poli- 
tics, itis the physicist’s politics and princi- 
ples that are once again debated. Einstein is 
the global thinker, a scientist with important 
social insights and a charismatic personal- 
ity besides. Richard Crockatt shows that 
the causes that he championed embody the 
world’s conflicts of the time: pacifism; zion- 
ism and Israel; the atomic bomb and the 
arms race; and world government. Writes 
Crockatt (a British scholar of US foreign 
policy and mid-twentieth-century interna- 
tional relations): “In looking at his life and 
ideas one sees over his shoulder into the 
world he lived in, not merely into his mind” 

Einstein’ first public foray into politics was 
in 1914. He was bold from the start: aged 35 
and teaching at the University of Berlin, he 
was one of only four signatories toa manifesto 
against the First World War. It was to counter 
another signed by 93 of his peers, defending 
Germany's ‘just war. He didnt state so openly, 
but Einstein hoped that Imperial Germany 
would lose the warand rise from the ashes as 
a democratic socialist government. 

Before the war, he was little-known 
outside physics. Celebrity enveloped him 
in 1919, when experimentalist Arthur 
Eddington confirmed the general theory 
of relativity. After this, Einstein — who 
was Jewish — was no longer a mere mortal, 
a fact that roused an increasingly anti- 
Semitic Germany. Anti-relativists with an 
anti-Jewish agenda attacked him. In this and 
only this instance, he lashed out in a news- 
paper article, which brought more attacks. 
He learned that detractors could agitate him, 
that he needed to protect his privacy and 
that entering the political fray disrupted his 
science, From then on, he made pronounce 
ments on political and moral issues that 
aimed to keep him above ordinary politics. 
These dimensions are all nimbly woven into 
Einstein and Twentieth-Century Politics. 

After the First World War, the League 
of Nations was founded to advance peace. 
Einstein had an ambivalent relationship with 
it, as did many of his liberal contemporar- 
ies. They feared that the military strength of 


Einstein speaking on science and 


for the United States in the face of the Nazi 
assault, Einstein refined his views. A grudg- 
ing pragmatism took hold: what good would 
come from supporting peace if freedom were 
lost? Allied military strength became a ben- 
efit. Such was his evolution on many issues, 
garnering criticism in private from his left- 
ist intellectual friends and in public from his 
foes, especially those in the US government. 
He was characterized 
variously as reckless, 
hypocritical, naive and 
pro-Soviet. 

One decision — 
promoting the build- 
ing of theatomicbomb 


—he regretted until he 
died. He wrote three y 
letters to US President vy 
Franklin Roosevelt: the Einstein and 
first two advocated for Twentieth 


Century Politics: 


thebomb;thelastindi. Gentry Patites: 


rectly warned against 


ation in 1933. 


—and that he never worked on the bomb — 
but he could notbreak the public’ perception 
that he was responsible for it. In the end, all 
he could do was to press more urgently for 
a supranational government that controlled 
millitary power, especially the bomb. 

Crockatt skilfully uses letters and articles 
from “the liberal wing of international opin- 
ion’, non-scientists all, to limn the subtleties 
of the issues. Philosophers Bertrand Russell 
and Albert Schweitzer; writers George Ber- 
nard Shaw, Thomas Mann and H. G. Wells; 
and Mahatma Gandhi — with a cameo by 
Sigmund Freud —all draw out the nuances 
in Einstein's positions and expose the com- 
plexities created bya rapidly changing world. 
‘They agreed, more or less, on the objective: 
a world government that limits the power 
ofthe nation-state and nullifies war. Details, 
suchas the role of the state, create the debate. 

Crockatt's deft pen and thoughtful 
approach form an engaging and reveal- 
ing image of Einstein the non-scientist. He 
marshals wealth of material into a convine- 
ing narrative. However, to tease out Ein- 
stein’s “distinctive intellectual and emotional 
qualities", Crockatt may have overreached. In 
essence, he alleges that Einstein examined 
every angle of a scientific idea but “rarely, 
if ever, questioned” his political principles 
because they were self-evident. 

I doubt it was that black and white. 
Crockatt himself describes numerous inci- 
dents in which Einstein altered his political 
positions fundamentally. Crockatt suggests 
that the shifts were tactical, but they must 
have required deep reflection and question- 
ing. These were profoundly consequential 
decisions. Nor was Einstein entirely the 
dispassionate, empirical scientist of his own 
characterization or that Crockatt assumes. 
Einstein's argument for rejecting quantum 
mechanics, that God “does not play dice’, 
\was more about intuition than well-founded 
scientific principle. 

Nevertheless, Einstein and Twentieth- 
Century Politics delivers what Crockatt 
promises: a picture of Einsteins world and 
mind, conveyed with an insightful brush. 
And these issues political and philosophical 
are sufficiently complex that they will surely 
live to see many other interpretations. 


Nancy Thorndike Greenspan is a writer 
based in Washington DC. Her most recent 
book is The End of the Certain World: The 
Life and Science of Max Born, 

I: nancy@nancygreenspan.com 
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Don’t let witch hunts 
taint investigations 
Donald Kornfeld and Sandra 
Titus argue that misconduct 
should be considered when 
investigating irreproducible 
research (Nature 537, 29-30; 
2016). In my view, this premise 
of ‘guilty until proved innocent’ 
risks turning a scholarly 
investigation into a witch hunt. 

Distinguishing poorly 
designed research from faked 
data is hard, but itis generally 
more difficult to prove 
misconduct than to identify 
the cause of irreproducibility. 
Moreover, investigating 
bad science costs less than 
examining misconduct (in 
terms of money, time, careers 
and so on), One estimate put 
the direct cost ofa misconduct 
enquiry at US$525,000 
(A.M. Michalek et al. PLoS Med. 
7, €1000318; 2010), 

‘The authors assert that the US 
National Institutes of Health's 
(NIH) training mandate for 
responsible conduct of research 
failed to reduce misconduct. 
Perusing research on the rate 
of misconduct, I find estimates 
that span several orders of 
magnitude, so it is unclear 
whether misconduct is or has 
been rising or falling. Thus, we 
cannot say what effect the NIH 
training scheme has had. 

‘The damage caused to the 
scientific record by publishing 
sloppy, plagiarized or fabricated 
research is ultimately the same. 
Kenneth Pimple Indiana 
University, Bloomington, USA. 
pimple@indiana.edu 


University on the rise 
without PhD students 


California State University in 
Northridge (CSUN) is ranked 
24th in the latest Nature Index 

of the 25 North American 
institutions classed as Rising 
stars’ (see go.nature,com/2dfvirb). 
Of these, CSUN has the 

greatest percentage increase 

in publications in 2012-15 in 


high-impact journals (up by 
190.61%). As professor emeritus 
at CSUN, my viewis that the rise 
isattributable to the university's 
unusual research model. 

The university does not 
offer any PhD degrees. Instead, 
science undergraduates and 
master’s students work alongside 
exceptional faculty members, 
who train and mentor them in 
research. The faculty includes 
28 prestigious members hired 
over the past decade. The dean 
of the science college, Jerry 
Stinner, backed by the university 
chairs, president and provost, 
organized CSUN funding to 
recruit and support them. This 
support included comprehensive 
packages of research equipment 
and supplies, specific laboratory 
renovations and a reduced 
teaching load while they set up 
theirlabs. 
Steve Oppenheimer California 
State University, Northridge, 
California, USA. 
steven.oppenheimer@csun.edu 


Hasten end of dated 
fossil-fuel subsidies 


Nations at last month's G20 
summit in China reaffirmed 
their 2009 commitment to 
phase out fossil-fuel subsidies, 
echoing a call from almost two 
decades ago to end subsidies 
that are “adverse in the long run 
to both the economy and the 
environment” (N. Myers Nature 
392, 327-328; 1998). 

Similar ‘perverse’ subsidies 
continue to encourage logging of 
the few remaining pockets of old- 
growth forest in western Canada 
and overfishing in the high 
seas, Yet the fossil-fuel industry 
receives the largest subsidy ofall, 
estimated by the International 
Monetary Fund (IME) last year 
at USS1,000 annually for every 
citizen in the G20 group. Most of 
this is provided by countries with 
energy taxes that are too low to 
cover the adverse effects of fossil- 
fuel consumption on human 
health and the environment 
(go.nature.com/2dbs2zf). 


‘The IMF also estimates that 
eliminating fossil-fuel subsidies 
‘would cut global carbon dioxide 
emissions by more than 20% 
and raise 
by $2.9 trillion (or 3.6% of 
global gross domestic product), 
Such a step would save up to 
$93 per tonne of greenhouse- 
gas emissions removed (see 
go.nature.com/2dowew). 

‘These sums alone would 
fund climate adaptation and the 
protection of imperilled global 
biodiversity for the next 30 years 
(D. P. McCarthy et al. Science 
338, 946-949; 2012). The money 
would also boost development 
of renewable energy sources and 
domestic support for a low- 
carbon economy. 

‘Tara Martin University of British 
Columbia, Vancouver, Canada, 
tara.martin@ube.ca 


Are farmed fish just 
for the wealthy? 


Christopher Golden and 
colleagues argue that farmed 
fish contribute little to global 
food security because they are 
“mostly exported to the wealthy 
countries of Europe and North 
America” (Nature 534, 317-320; 
2016). In fact, more than 90% of, 
farmed fish produced in China, 
India, Indonesia, Bangladesh, 
Egypt, the Philippines and 
‘Myanmar — some of the world’s 
largest aquaculture-producing 
developing countries — remains 
in domestic markets (see 
go.nature.com/2dqwwh). 

Aquaculture products are 
more accessible to the poor in 
many developing nations than 
ever before (K. A. Toufique 
and B. Belton World Dev. 

64, 609-620; 2014). And the 
aquaculture boom of the past 
two decades has stabilized world 
fish prices (S. Tveteras et al. PLoS 
ONE7, €36731; 2012). 

The realities of the supply and 
demand of aquaculture products 
‘mean that these now complement 
capture fisheries for global food 
and nutrition security. 

Ben Belton Michigan State 
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University, East Lansing, USA. 
Simon R. Bush Wageningen 
University, the Netherlands. 
David C. Little University of 
Stirling, UK. 
beltonbe@msu.edu 


Christopher Golden et al. 
reply — Our argument is 
that most farmed fish are 
not reaching nutritionally 
vulnerable people in the 
low-income, food-deficit 
countries of sub-Saharan 
Africa and the Pacific islands 
(Nature 534, 317-320; 2016). 
In those nations, fish is a 
traditional food source that 
comes primarily from capture 
fisheries, including subsistence 
harvests (M.M. Dey et al. Mar. 
Policy 67, 156-163; 2016). 
Domestic consumption and 
import of aquaculture products 
are still relatively insignificant 
(see go.nature.com/2dinzuc). 
Insuch places, aquaculture 
policy interventions need to 
be optimized for nutritional 
value and distribution to 
food-insecure populations, 
This could be achieved through 
appropriate regulations and 
market instruments (such as 
tax incentives or subsidies) and 
public-health campaigns, in 
close alliance with conservation 
strategies for sustainable 
fisheries. 
Harvard T. H. Chan School 
of Public Health, Cambridge, 
Massachusetts, USA. 
golden@hsph.harvard.edu 


CORRECTIONS 
The Outlook article ‘Industrial 
strength’ (Nature 537, $57- 
$59; 2016) incorrectly stated 
that the 1999 trial at the 
University of Pennsylvania was 
based on a retrovirus; it was in 
fact based on an adenovirus. 
Also, the Outlook Q&A 
‘illuminated Universe’ (Nature 
537, S205; 2016) incorrectly 
gave the amount of dark 
energy in a cubic metre of 
space as 10-27 kilograms 
instead of 10° kilograms. 


NATURE | 171 


OBITUARY 


Roger Yonchien Tsien 


(1952-2016) 


Creator of a rainbow of fluorescent probes that lit up biology. 


oger Yonchien Tsien pioneered 
R* use of light and colour to ‘peek 
and poke’ at living cells to see how 
they work. His most famous achievement, 
recognized by a share of the Nobel Prize in 
Chemistry in 2008, transformed biology: he 
developed a rainbow of probes, based on the 
jellyfish green fluorescent protein (GFP), to 
illuminate cell structure and function. 

Roger died suddenly in a park near his 
home in Oregon on 24 August. He was born 
in New York in 1952 with science in his 
blood. His father’s cousin was Tsien Hsue- 
shen (Qian Xuesen), architect of China's 
missile and space programme, Roger would 
combine his father’s engineering talent with 
the medical interests of his mother, a nurse. 

Roger had an early passion for chemistry. 
Despite his Chinese name (which means 
‘always healthy’), childhood asthma often 
kept him indoors, reading and drawing. 
He fought going to kindergarten until his 
teacher allowed him to bring ina favourite 
book: he picked All about the Wonders of 
Chemistry. From the age of eight, he per- 
formed increasingly complex and some- 
times hazardous chemistry experiments at 
home. At 16, he went to Harvard University 
in Cambridge, Massachusetts (avoiding 
the Massachusetts Institute of Technol- 
ogy, where his father, uncles and broth- 
ers studied), and sampled many subjects. 
Ironically he found the chemistry courses 
“so distasteful” that he abandoned them for 
neurobiology. 

Roger then spent nine years at the 
Physiological Laboratory at the University of 
Cambridge, UK. First he was a PhD student 
with the eminent muscle physiologist Rich- 
ard Adrian; then he did a postdoc with one 
of us (T.R.). He emerged as an ingenious, 
largely self-taught synthetic chemist. 

‘Much of Roger's early work was directed 
at imaging neural activity, by trying to 
develop tracers of sodium- or calcium-ion 
movements that support brain signalling. 
By 1980, he had invented quin2, a synthetic 
fluorescent dye that selectively binds to 
calcium, and had devised a clever way to 
sneak this dye and other probes into intact 
cells, This first practical probe for calcium 
found wide early use in studies of intracel- 
lular calcium signalling. 

‘Amazingly, Roger struggled to find a 
faculty position because his work straddled 
disciplines. In 1982, he joined the physiology 
department at the University of California, 
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Berkeley, where colleagues encouraged him 
to create more tools. First came superior 
calcium dyes, in particular fura2, which is 
strongly excited by different wavelengths 
of ultraviolet light before and after binding 
calcium, Capitalizing on this feature of fura2 
(and indicators with similar optical proper 
ties), Roger and his group made it much 
easier to monitor calcium under challenging 
conditions, for example, across the width of a 
cell. His group also created valuable fluores- 
cent sensors for pH and for sodium. 

In 1989, facing resource constraints, Roger 
transferred to the University of California, 
San Diego (UCSD). Here he remained for 
the rest of his career. He wanted to make 
sensors that could be genetically encoded, 
allowing researchers to target specific cell 
types without having to inject a tracer. In the 
1990s, he saw the potential of GEP. The pro- 
tein had been isolated from jellyfish in the 
1960s by Osamu Shimomura (who shared 
the 2008 Nobel) and cloned by Douglas 
Prasher in 1992. Martin Chalfie, who also 
shared in the Nobel, first used GEP to image 
living cells in 1994, 

Roger's lab pioneered the development 
of GEP variants, Through a combination of 
rational design and random mutagenesis, 
they created dozens of bright fluorescent 
proteins of various colours based on GEP. 
Roger later produced longer-wavelength 
sensors based on red fluorescent proteins, 


2016 


He took great pleasure in naming probes 
after fruits such as the tomato, cherry and 
plum. 

GEP variants are now ubiquitous in bio- 
logical research. They can be used to bind 
with and track cancer cells, aid gene therapy, 
image mitosis, paint neurons in rainbow 
colours and spy on signalling in subcellular 
organelles such as mitochondria. They have 
even been used to make art. 

Roger's group at UCSD developed 
many other optical probes, including fast- 
response sensors to measure electrical 
signals across cell membranes, and dyes 
for tracking proteins with a combina- 
tion of light and electron microscopy. In 
recent years, he had two main projects: the 
design of fluorescent tracers to illuminate 
tumours during cancer surgery; and the 
storage of long-term memory by the pat- 
tern of holes in the perineuronal net that 
surrounds neurons in the brain. 

Roger's trajectory helped to make it 
respectable, indeed fashionable, to spend a 
career inventing reagents and methods. He 
is named in more than 160 US patents, often 
aslead inventor. Although naturally keen to 
participate in the first application of his new 
tools, he was also generous in providing 
materials to other scientists 

Roger co-founded three biotech compa- 
nies that capitalized on his inventions. He 
semi-seriously quipped to his wife Wendy 
that, apart from the potential human benefit, 
the main point of these companies was to 
provide suitable jobs for his postdocs. 

Roger was a fine pianist and briefly 
considered a musical career. A gifted ama- 
teur photographer — a hobby in keeping 
with his passion for colour and imaging — 
he enjoyed holidays in the wild outdoors, 
often taking arduous treks, camera in hand. 

Roger will be hugely missed by family, 
friends, colleagues and the many scientists 
who appreciated him asa brilliant enabler 
of scientific progress, m 


Timothy J. Rink is an independent 
biotechnology consultant. He worked with 
Roger Tsien at Cambridge. Louis Y. Tsien is 
Roger’selder brother and lives in Watertown, 
“Massachusetts. Richard W. Tsien is Roger’ 
eldest brother and a neuroscientist at New 
York University, New York, USA 

e-mails: rinktj@gmail.com; 
louis.tsien@gmail.com; 
richard.tsien@nyume.org 
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Megafloods downsized 


A fresh look at the Channeled Scablands of North America shows that the ancient floods that scarred that landscape were 
smaller than is commonly assumed. This result could revise estimates of similar floods on Mars. SEE LETTER P.229 


J. TAYLOR PERRON & JEREMY G. VENDITTI 


he enormous canyons of the Channeled 
Ts in the northwestern United 

States, many of which contain no rivers, 
puzzled geologists for decades. The gradual 
realization that these canyons were carved 
thousands of years ago by huge floods spawned 
by melting glaciers challenged the idea that 
Earth’s surface is shaped by gradual, steady 
erosion. However, on page 229, Larsen and 
Lamb' show thatat least one of the canyons was 
formed by a succession of much smaller floods, 
a finding that has implications for flood-carved 
canyons on Mars. 

When the geologist J Harlen Bretz proposed 
in the 1920s that the Channeled Scablands 
were created by a catastrophic flood?, his 
ideas were attacked relentlessly by geologists 
who subscribed to the mainstream view that 
erosion is slow and steady, and who wanted 
to distance their profession from the notion 
ofa biblical deluge. Bretz did not identify the 
source of the flooding until the 1940s, when 
his colleague Joseph Pardee found evidence* 
that ancient Lake Missoula, which formed 
at the margin of the melting Cordilleran ice 
sheet roughly 15,000 years ago, had drained 
catastrophically to the west. This discovery 
led to the gradual acceptance of Bretz’s flood 
hypothesis, which was later supported by 
studies that considered the mechanics of large 
flows through canyons", Subsequent analyses 
of sediments deposited throughout the region 
showed that the Channeled Scablands had 
experienced not one but many floods’, 

‘Although the flood origin of the Channeled 
Scablands is no longer disputed, the sizes of 
the individual floods remain uncertain. It has 
become common practice to place an upper 
bound on the flow rate of the floods by assum- 
ing that they filled the present-day canyons to 
the brim. Estimated flood magnitudes based 
on this assumption® range up to 60 cubic 
kilometres per hour — nearly 100 times the 
average flow rate of the Amazon River today’. 
But these estimates might be much too large. 
Glaciologists have argued that itis difficult for 
ice sheets to store enough water to produce 
such enormous floods*. The brimful-flood 
model also requires the unlikely scenario that 
each flood passing through the canyons was 


@ Erosion by brimful floods 


First 
flood 


Later 
flood 


Former 
canyon floor 


Final 
flood 


b Erosion by smaller floods 


Figure 1 | Competing models of canyon erosion by floods. a, It is commonly assumed that canyons 
form in accordance with a brimful model, which requires progressively larger and deeper floods as the 
canyon erodes. b, Larsen and Lamb! use remnants of former canyon floors to show that Moses Coulee 


‘was instead shaped by a sequence of smaller floods. 


larger than the one that preceded it, because 
the canyon deepens as each successive flood 
erodes the bedrock (Fig, 1a). 

Larsen and Lamb wade into this debate and 
present evidence that a series of consistently 
sized, moderate floods eroded the canyons of 
the Channeled Scablands. In this scenario, the 
first flood filled the shallow, newly formed can- 
yons to the brim, but subsequent floods only 
partly filled the deepening canyons (Fig. 1b). 
They studied Moses Coulee (Fig. 2), a canyon 
in which a series of bench-shaped terraces 
preserves the remnants of former canyon 
floors that were abandoned by the flood water 
as the canyon was progressively eroded. 

Using previous estimates of the forces 
required to erode blocks of rock from the 
canyon floor, and a computational model 
of flood flow through the canyon, the 
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authors constrained the minimum flow rate 
corresponding to each remnant canyon floor. 
Their calculated flow rates are consistent with 
the presence of gravel bars that the most recent 
floods deposited in the canyon. Brimful floods 
would have instead suspended the gravel (and 
even larger boulders) high in the flow, prevent- 
ing deposition, Larsen and Lamb conclude that 
Moses Coulee was eroded by repeated floods 
of no more than 2 km’ h*, This flow rate is by 
no means small — it is more than three times 
that of the Amazon River’ — but it is much 
smaller than the maximum of 10 ky’ h" that 
is implied by the brimful model for Moses 
Coulee. 

Floods as large as those discussed by Larsen 
and Lamb have not been observed in recorded 
history: This makes it difficult to test some of 
the authors’ assumptions, such as the estimated 


JOHN LATTA 


Figure 2| Moses Coulee, Upstream view along the east wall of Moses Coulee, a canyon in the Channeled 


Scablands of Washington state. 


forces required to erode blocks of rock, and the 
notion that the floods were just large enough to 
erode their beds. It will also be challenging to 
confirm that similarly modest floods formed 
other Channeled Scabland canyons, because 
not all canyons contain features that record the 
progress of canyon incision in the same way as 
the well-preserved terraces in Moses Coulee. 
However, observations of erosion by smaller, 
modern floods’ support the principles behind 
the authors’ approach. 

Larsen and Lamb’ results raise the possibility 
that the largest known floods in the Solar 
System were smaller than previously esti- 
mated, Numerous floods crossed the surface 
of Mars during the past few billion years, 
carving enormous canyons that dwarf the 
Channeled Scablands. The source of the flood 
water remains mystery, but each flood prob- 
ably originated either when water erupted 
from an underground aquifer, or when a sur- 
face reservoir, perhaps created by melting ice, 
suddenly drained — a scenario similar to that 
of Lake Missoula. Brimful flow rates estimated 
from high-water marks in the biggest Martian 
canyons are tens of times greater than the larg- 
est estimates for the Channeled Scablands”. 
‘The immensity of these floods is even more 
shocking given the cold, dry conditions that 
have characterized the surface of Mars for at 
least the past 2 billion years. 

Larsen and Lamb do not attempt to model 
the Martian floods, but their results support 
previous suggestions’ that the canyons on 
Mars could have been carved by a succession 
of smaller floods. Such a scenario could help 
to resolve the discrepancy between flow rates 


estimated from canyon topography and geo- 
logical constraints on water supply rates", 
A succession of floods would have required 
repeated replenishment of the water source, 
which has implications for Mars's ancient 
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climate. A detailed study of water flow through 
‘Martian canyons that is based on erosional 
and fluid-dynamical constraints would help 
to clarify the magnitude of their catastrophic 
origins. The case of the Channeled Scablands 
reminds us that Earth still has many lessons to 
teach us aboutalien worlds. = 


J. Taylor Perron is in the Department of 
Earth, Atmospheric and Planetary Sciences, 
Massachusetts Institute of Technology, 
Cambridge, Massachusetts 02139, USA. 
Jeremy G. Venditti is in the Department of 
Geography, Simon Fraser University, Burnaby, 
British Columbia V5A 186, Canada. 
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Measuring our 
narrow strip of life 


In ine with previous research, a demographic analysis corroborates the presence 
ofa limit to human lifespan, indicating that increases in life expectancy are likely 
to slow down or stop over the coming years. SEE LETTER P.257 


S. JAY OLSHANSKY 


T: British author Annie Besant once 
wrote’: “out of the darkness of the 
womb, into the darkness of the grave, 
man passes across his narrow strip of life” 
‘The ration of time allocated to humans is of 
profound personal and scientific interest. On 
page 257, Dong et al,” turn to the demographic 
literature to analyse whether there is a limit to 
human lifespan — and find evidence to suggest 
that there is. 

Before discussing the study at hand, we 
should define some relevant terms. Lifespan 
describes how long an individual lives. Life 
expectancy is a population-based estimate 
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of expected duration of life for individuals at 
any age, based on a statistical ‘life table And 
maximum lifespan is the age reached by the 
longest-lived member of a species. 

Human life expectancy has risen fairly 
steadily and rapidly over the past 150 years’ in 
most countries. In 1990, colleagues and I pre- 
dicted that this increase would slow over time’, 
and this has proved to be the case’, Maximum 
lifespan also seems to have risen steadily’, but 
this too might have reached an upper asymp- 
tote — no one is known to have lived longer 
than Jeanne Calment, who died in 1997 at the 
age of 122. Thus, the debate about life's limits 
is ongoing. 

Some scientists speculate that fixed limits to 
VoL 


538 | NATURE | 175 


| RESEARCH | NEWS & VIEWS 


Figure 1 | A limit tolifespan? At 116, Emma Morano is the oldest known person alive today. Dong et al. 
provide evidence that we are approaching the natural limit to human lifespan. 


life are unlikely to exist, because they cannot 
be observed using the tools of mathematical 
demography”. Others suggest that unknown 
technological advances in the future will con- 
tinue to drive down death rates", leading to 
accelerated gains in life expectancy and maxi- 
mum lifespan. And yet others argue that there 
isa limit to lifespan’. 

Dongand colleagues used demographic data 
to investigate whether there isa limit to human 
lifespan and, by implication, life expectancy. 
They first hypothesized that, if a biological 
limit does not exist (or is currently unobserv- 
able), the age group experiencing the greatest 
increase in survival should shift to ever-older 
groups over time. This hypothesis makes 
perfect sense, and the authors discovered 
that, in most countries that have reliable data, 
the greatest improvement in survival in the 
oldest age groups peaked in about 1980 and 
has not shifted since. 

Next, the researchers investigated whether 
increases in maximum lifespan had been 
observed in recent decades. They discov 
ered that, since the death of Calment, maxi 
mum lifespan for humans has regressed. This 
occurred in spite of the increasing size of age- 
ing populations worldwide, which, in itself, 
should have led to an increase in maximum 
lifespan. Dong et al. conclude that these two 
observations represent compelling evidence 
that human lifespan has ‘natural limit’ (Fig. 1) 

Scientists who study ageing know that there 
is considerable variation in the duration of life 
across species”, but within species there are 
fixed attributes associated with life history — 
and longevity determination is one of them. 
Under protected living conditions in which 
predation is largely removed, mice tend to live 
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about 1,000 days", dogs about 5,000 days" and 
humans about 29,000 days'’. Clearly, there are 
biological reasons for each species’ average 
lifespan, so why would anyone think that peo- 
ple could live for much longer than we do now? 

The answer lies in the historical context 
within which human longevity has changed. 
‘The 30-year rise in life expectancy at birth 
seen during the past century has nothing to 
do with a modified rate of ageing”. Instead, 
it reflects improvements in public health that 
have drastically reduced early-age mortality, 
allowing most people in developed nations to 
reach old age for the first time in history. Death 
now clusters in people between the ages of 65 
and 95 (ref. 11), But, without further biomedi- 
cal breakthroughs, life expectancy cannot con- 
tinue to rise by much, and so future longevity 
gains will diminish. The crucial question is 
how much more survival time can be gained 
through medical technology. With fixed 
life-history traits, it would seem that we are 
running up against a formidable barrier, 

As the authors rightly point out, the idea of 
a ‘natural limit’ to life does not imply that such 
alimit isa direct by-product of some geneti- 
cally driven program that causes both ageing 
and death, Fixed genetic programs that directly 
cause ageing and death cannot existasa direct 
product of evolution, because the end result 
would be death at an age beyond which almost 
every member of a species would ordinarily 
live. A genetic time bomb designed to kill us at 
older ages is equivalent to automobile manu- 
facturers building in an explosive device that 
is set off only when a car reaches one million 
miles. Because most cars are never driven that 
far, such a device would be useless. 

How sit possible to havea biological limit to 
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life, yet no genetic program that runs it? There 
are biological clocks that measure time from 
conception and birth, but these metronomes are 
there to transform a fertilized egg into an adult 
capable of reproducing. These fixed genetic 
programs for growth, development, matura- 
tion and reproduction (collectively known asa 
life-history strategy) are products of more than 
3.7 billion years of evolution. Biological metro- 
nomes do not measure the time to age or die: 
instead, ageing is an inadvertent by-product of 
these clocks, which are designed to sustain life. 

‘This distinction is important — it means 
that there is no fixed limit beyond which 
humans cannot live, but that there are, nev- 
ertheless, limits on the duration of life that 
are imposed by other genetically determined 
life-history traits. Think of constraints on run- 
ning speed as an analogy. No genetic program 
specifically limits how fast humans can run, 
but biomechanical constraints on running 
speed are imposed by a fixed body design that 
evolved for other purposes. The absence of 
ageing and death programs opens the door to 
non-genetic interventions that extend health 
and length of life, just as new training methods 
enable us to run incrementally faster. This is 
precisely why modifying behavioural risk fac- 
tors such as diet and exercise does extend the 
period of healthy life, but yields diminishing 
gains in life expectancy. 

Dong and colleagues remind us that human- 
ity is approaching a natural limit to life. This 
limit is now apparent in national vital statis- 
tics, Humanity is working hard to manufacture 
more survival time, with some degree of suc 
cess, but we should acknowledge that a geneti- 
cally determined fixed life-history strategy 
for our species stands in the way of radical life 
extension. 


8, Jay Olshansky is in the UIC School of 
Public Health, University of Illinois at Chicago, 
Chicago, Illinois 60612, USA, and at Lapetus 
Solutions, Wilmington, North Carolina, 
e-mail: sjayo@uic.edu 
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Moon churn 


‘The Moon's surface is being mapped by 
NASA's Lunar Reconnaissance Orbiter 
‘spacecraft, to aid planning for future 
missions. On page 215, Speyerer etal. report 
how images taken by the orbiter’s camera 
have been used to quantify the current rate at 
which lunar craters form asa result of surface 
impacts by comets, asteroids and associated 
‘fragments (E. J. Speyerer et al. Nature 538, 
215-218; 2016). 

The authors compared pairs of images of 
‘the Moon's surface taken at different times, 
and discovered that 222 craters had formed 
inthe periods between the images being 
taken. They therefore estimate that about 
180 craters of at least 10 metres in diameter 
form annually across the entire Moon. This is. 
33% more than would have been expected 
from a commonly used model of impact 
frequency. 

By calculating the ratios of surface 
reflectance between pairs of images, 
‘Speyerer et al. uncovered distinct zones of 
subtly modified reflectance around the newly 
formed craters, The zones extend many 
crater widths out from the centre and are 
not visible to the naked eye (pictured are the 
ratios fora 12-m crater; dark regions reveal 
zone that splays out up to 1,800m from 


Inretrospect 
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the centre). The authors propose that these 

zones are caused by impact-induced jets of 

melted and vaporized material formed early 
in the crater-formation process. 

‘The researchers also detected thousands 
of subtle surface disturbances — changes in 
local reflectance that lack a resolvable crater 
rim. They interpret many of these as the scars 


Fifty years of 
C, photosynthesis 


Halfa century after the discovery of a plant photosynthetic pathway termed C,, 
researchers are working to engineer this efficient pathway into crops such as rice 


to maintain food security. 


JULIAN M. HIBBERD & ROBERT T. FURBANK 


an analysis of photosynthesis that gave 

birth toa new field. Their work not only 
stimulated intense biochemical research to 
define the mechanisms of a new photosyn- 
thetic pathway, but also fed into many other 
disciplines. Ecologists found that the pathway 
could explain species distributions. Geolo- 
gists gained greater insight into changes in 
the isotope composition of sediments and 
fossils. And evolutionary biologists started 
to investigate the highly complex pathway, 
which is found in many plant lineages and is 


F=. years ago, Hatch and Slack' published 


now considered one of the most remarkable 
examples of convergent evolution —a process 
in which the same feature evolves indepen- 
dently in different unrelated species. 

Fifteen years before Hatch and Slack’s work, 
Calvin and co-workers had identified the first 
photosynthetic pathway by which inorganic 
atmospheric CO, is incorporated (fixed) into 
organic carbon-containing molecules”. The ini- 
tial step in the pathway produces amolecule that 
contains three carbon atoms, and it was widely 
thought that all land-dwelling plants used this 
‘Cy photosynthesis. However, this assumption 
was disproved by Hatch and Slack’ carefully 
executed experiments. They used the carbon-14 


of secondary impacts that churned up the 
upper few centimetres of the surface without 
forming a resolvable crater. Speyerer and 
colleagues therefore propose that the upper 
2centimetres of loose surface material on the. 
Moon will be reworked in about 81,000 years, 
1100 times faster than previously predicted. 
‘Andrew Mitchinson 


isotope to create "CO, and then tracked how 
the "\C was incorporated into molecules in 
sugarcane plants, Remarkably, they found that 
the first step of carbon fixation was actually 
into a four-carbon molecule’. This alternative 
pathway became known as C, photosynthesis. 
‘Atthe time, the significance of Hatch and Slack’s 
finding was that two photosynthetic pathways 
were now known to operate in plants. 

‘The study by Hatch and Slack explained 
some puzzling reports. Laboratories as far 
apart as Hawaii and Russia had observed unex- 
pected carbon incorporation patterns when 
CO, was supplied to sugarcane and maize 
(corn) leaves". However, Calvin and others 
questioned the validity of those reports, and 
the findings were not accepted by the field. The 
main objection was that "CO, had often been 
introduced to leaves in the dark, when photo- 
synthesis is not active, which risks creating 
artefacts of non-photosynthetic metabolism. 

Hatch and Slack’s key advance was provid- 
ing a pulse of "CO, to leaves in light, followed 
by introduction of CO, that did not contain 
4G, Such ‘pulse chase’ experiments can track 
a “C wave as it transits through molecules 
ina pathway. The approach showed that the 
carbon was first incorporated into malate 
(Fig, 1),a molecule containing four carbons, 
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Figure 1 | C, photosynthesis, More than 100 years ago, circular structures 
known as Kranz anatomy (highlighted in green in the cross-sectional 
image'* on the left) were observed in leaves, but their role was unknown. 
In 1966, Hatch and Slack used the “C isotope to track the fate of the 
carbon (red) of carbon dioxide as it was incorporated into intracellular 
organic carbon molecules during photosynthesis in leaves illuminated by 
light’, They found that the “C was incorporated into a malate molecule 


and then transferred to the three-carbon 
molecule 3-phosphoglycerate (also produced by 
the enzyme RuBisCO in the C, pathway). This 
demonstration of carbon flux is still the most 
definitive evidence for a C, photosynthetic 
pathway. 

Even before Hatch and Slack’s discovery, 
there were clues that the physiology of some 
plants was different’, Tropical grasses often 
grow substantially faster, have higher photo- 
synthetic rates and use water more efficiently 
than other plants. Furthermore, peculiar cir- 
cular structures had been spotted more than 
100 years ago in some leaves’. These struc- 
tures, known as ‘Kranz anatomy’ (Kranz is 
the German word for wreath), are concentric 
circles of mesophyll cells that surround bundle 
sheath cells around the veins. The role of Kranz 
anatomy became apparent only in the context 
of C, photosynthesis. 

By the 1980s, the fundamentals of the spe- 
cialized biochemistry and modified anatomy 
of aC, leaf were known. All major enzymes 
required for the C, cycle had been identified, 
and the requirement for the C, pathway to 
be compartmentalized into two different cell 
types had been linked to Kranz anatomy. The 
Cy leafbecame a model for understanding spe- 
cific cells of plants’, and researchers proposed 
that Kranz anatomy developed as a result of 
a diffusible molecule emanating from veins’. 

In the decades following Hatch and Slack’s 
advance, C, photosynthesis inspired other 
studies in diverse fields. In agriculture, it 
explained the high rates of photosynthesis and 
low water loss of some crops. The key factor 
is the PEPC enzyme, which acts at the start 
of the C, photosynthesis pathway. PEPC has a 
higher binding affinity for carbon than does 
the RuBisCO enzyme that acts at a similarly 
early step in C, photosynthesis. This means 
that the stomatal pores that allow atmospheric 
CO, to enter the leaf don't need to open as 
wide in a C, plant’, so water loss through the 
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stomata is reduced. 

PEPC incorporates "°C more readily than 
RuBisCO does’, and the resulting differences 
in leaf carbon-isotope signatures allows species 
classification as C; or C, plants in living tissue 
or fossils. This approach soon piqued the inter- 
est of ecologists and evolutionary biologists. 

Ecologists realized that there were clear geo- 
graphical distribution gradients of C, and C, 
plants, with C, plants dominating open habi- 
tats in the tropics and subtropics and gradu- 
ally becoming less common farther from the 
equator”, The distribution was linked to the 
selective pressures that drive C, evolution, 
with C, plants commonly associated with dry 
conditions, Fossil studies provided insight into 
ancient environments, The 'C content of C, 
leaves led to the discovery" that C, grasses rap- 
idly expanded and came to dominate prairies 
and savannahs around 10 million to 6 million 
years ago: animals grazing on savannah grasses 
produce fossils that have a C,-type carbon- 
isotope signature; those browsing on C, trees 
produce fossils that have a C, isotope signature. 

The discovery that C, plants arose across 
evolutionarily distantly related species 
implies repeated evolution of this complex 
photosynthetic pathway from the ancestral 
C, system. Despite the high complexity of the 
C, system, it has evolved independently many 
times, and the number of unrelated groups 
of C, plants has grown steadily, to around 61 
lineages". Several factors might have facili- 
tated the repeated evolution of C, photosyn- 
thesis. For example, proteins of the C, pathway 
seem to be present in the ancestral C, state”. 
‘That means that, in principle, C, photosynthe- 
sis harnesses components already found in C, 
plants. Indeed, part of the C, pathway operates 
in some C, tissues", 

‘The repeated evolution of the C, system 
continues to intrigue. Although C, species are 
widely distributed within flowering plants, 
there are very few C, trees,a phenomenon that 
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containing four carbon atoms. This four-carbon molecule pointed toa new 
photosynthesis pathway, which was termed C, photosynthesis. Although 
C, photosynthesis has evolved independently many times across the plant 
phylogeny, including in maize (corn), many key crops such as potatoes, 
wheat or rice use a less efficient photosynthesis pathway called C,..A future 
goal isto try to engineer this efficient C, photosynthesis pathway into other 
plants such as rice (or potentially other C, crop plants). 


has not yet been fully explained. For nearly 
40 years, the C, pathway was thought to be 
dependent on pathway compartmentalization 
between mesophyll and bundle sheath cell 
types in the Kranz anatomy, but then several 
single-celled C, plants were discovered that 
broke that rule”, 

This year, Hatch and Slack attended a 
fiftieth anniversary conference of their dis- 
covery in Canberra, Australia, They attended 
every session and were clearly excited, enter- 
tained and at times probably bemused by the 
variety of work now going on asa consequence 
of their work. Photosynthesis is approximately 
50% more efficient in C, plants than in C, spe- 
cies", leading to substantial yield benefits. 
Rice, the staple crop for more than half of 
the world's population, uses the C, pathway. 
‘Asa consequence, substantial international 
efforts are now under way to understand the 
highly complex C, system sufficiently to allow 
it to be engineered into C, crops to increase 
their yield. 
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A map of human 


wanderlust 


Genetic studies of individuals from geographically diverse human populations 
provide insights into the dispersal of modern humans across the globe and how 
geography shaped genomic variation. SEE ARTICLES P.201 & P.207 & LETTER P.238 


SERENA TUCCI & JOSHUA M. AKEY 


remarkable feature of modern humans 
A our wanderlust, which the poet 

Charles Baudelaire famously referred" 
toas “Thorreur du domicile”. From our evolu- 
tionary birthplace in Africa’, modern humans 
have migrated to nearly every habitable corner 
of Earth (Fig. 1), overcoming obstacles such as 
ice, deserts, oceans and mountains. The num- 
ber, timing and routes of human dispersals out 
of Africa have implications for understanding 
our past and how that past influenced con- 
temporary patterns of human genomic varia- 
tion. Three studies on pages 207, 201 and 238 
(Malaspinas et al’, Mallick et al.'and Pagani 
etal’) describe 787 new, high-quality genomes 
of individuals from geographically diverse 
populations, providing opportunities to refine 
and extend current models of historical human 
migration. 

In the past decade, the maturation of whole- 
‘genome sequencing technology has enabled 
data to be generated on a scale that was pre- 
viously difficult to imagine. Genome-scale 
studies in humans, such as the 1000 Genomes 
Project’, which was completed last year, have 
contributed to a catalogue of genetic variation 
and genomic regions that confer the ability to 
adapt to diverse environments. Nonetheless, 
existing genetic data are often constrained 
by several factors, including limited breadth 
of population sampling and low-coverage 
data (in which each region of the genome is 
sequenced only a few times, leading to high 
error rates and missed variants). To address 
this issue, the current studies collect high- 
coverage sequence data for individuals from 
more than 270 populations across the globe. 
By studying the genetic diversity within and 
between these populations, the groups can 
tackle many questions about our past. 

Cataloguing genetic data from indigenous 
populations, which are often difficult to access 
and are rapidly disappearing, is an important 
achievement. Mallick et al. and Pagani et al. 


made great efforts to comprehensively sample 
regions that are typically under-studied; these 
include African populations, which have con- 
siderable genetic, linguistic and cultural diver- 
sity. Similarly, Malaspinas et al. describe the 
first extensive survey of human genetic diver- 
sity in Australia —a poorly studied region that, 
together with New Guinea, contains some of 
the earliest archaeological and fossil evidence 
of modern humans outside Africa. 

‘The high-resolution portrait of human 
genetic diversity afforded by these studies allows 
new inferences to be made about our migration 
out of Africa. There are currently two conflict- 
ing models for such human dispersal. The 
first hypothesizes a single event that occurred 
around 40,000-80,000 years ago. Under this 
scenario, all present-day non-Africans trace 
their ancestry to a single population, By con- 
trast, the multiple-dispersal model’ posits that 
an initial migration out of Africa occurred as 
early as 120,000-130,000 years ago", culmi- 
nating in the peopling of southeast Asia and 
Australasia, possibly viaa southern migration 
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route along the coastline of the Arabian pen- 
insula and the Indian subcontinent. This early 
dispersal was followed by a second migration 
from Africa, through the Levant, which resulted 
in the peopling of mainland Eurasia. 

Superficially, the current studies seem to 
come to different conclusions about out-of- 
Africa dispersals. Pagani et al. found that about 
29 of genomes from individuals of Papua New 
Guinean ancestry indicate that their ancestors 
separated from Africans earlier than did other 
Eurasians. This observation is consistent with 
a multiple-dispersal model in which an early 
expansion of modern humans from Africa 
led to the peopling of Australasia around 
120,000 years ago. This early out-of-Africa 
migration would have been followed by subse- 
quent dispersals, and would have contributed 
onlyasmall amount of ancestry to present-day 
Papuan individuals. Cranial morphology and 
other genetic data also support the idea of an 
early expansion’, 

‘Malaspinas et al. and Mallick et al. con- 
sider a different sequence of events, in which 
all contemporary non-Africans branched off 
from a single ancestral population. Malaspi- 
nas and colleagues provide evidence that, on 
leaving Africa, modern humans immediately 
separated, leading to two waves of dispersal. 
‘AS previously proposed”, one wave led to the 
peopling of Australasia, whereas the other 
contributed to the ancestry of present-day 
mainland Eurasians. Mallick and co-workers 
propose that this early separation instead 
occurred between west and east Eurasians, 
‘meaning that present-day people in Australia 
and Papua New Guinea might be descended 
from the same wave as east Asians. 


Figure 1 | A cave painting depicting human migration, ‘Three genetic studies" * of individuals from 
geographically diverse populations provide clues about human history, including when and how many 
times humans moved out of Africa and throughout the world. 
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However, neither Mallick et al, nor Malaspi- 
nas et al. exclude the possibility of multiple 
out-of-Africa dispersals. Indeed, their models 
are consistent with earlier dispersals, as long 
as these early voyagers made little or no con- 
tribution to the gene pool of contemporary 
non-African populations (which is essentially 
what Pagani ct al, find). Studies of ancient 
DNA clearly show that large-scale population 
turnovers have happened throughout human 
history: populations that once lived in Eurasia, 
for example, vanished without a trace, except 
for their bones"™, Thus, although some differ- 
ences between the proposed models are yet to 
be reconciled, they are not as disparate as they 
might seem tobe. 

The three studies also provide resources 
to better define models of genetic mixing 
between modern humans and their archaic 
hominin relatives, such as Neanderthals 
and Denisovans, Malaspinas and colleagues 
propose that the genomes of present-day 
Aboriginal Australians might harbour traces 
ofan ancient liaison with an unknown homi- 
nin group. Although evidence for gene flow 
from an unknown hominin group is tentative, 
it highlights the potentially surprising things 
that can be learnt from a comprehensive 
sampling of human genomic variation. 

These studies fill in some missing pieces 
in the puzzle of human history, but many 
fascinating questions remain. The continued 
sampling of human genomic diversity and 
the development of increasingly sophisticated 
statistical tools promise to reveal more secrets 
about our past. Nonetheless, it is crucial to 
recognize the limits of genetics. As previously 
pioneered”, the integration of data across 
traditionally distinct disciplines, such as lin- 
guistics, archaeology, anthropology and genet- 
ics, will be necessary to fully retrace the steps 
taken by early humans as they explored and 
colonized the world. w 
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Lethal violence deep in 
the human lineage 


Researchers estimate that the incidence of human lethal violence at the time of the 
origin of our species was about six times higher than for the average mammal, but 
about as violent as expected, given our great-ape ancestry. SEE LETTER P.233 


MARK PAGEL 


re humans naturally violent, as the 
Aiweecisetty philosopher 

Thomas Hobbes thought’, with the 
prevailing condition of humans being one 
of “continuall feare, and danger of violent 
death’, or as Jean-Jacques Rousseau imag- 
ined? a century later, neither good nor bad 
but moulded by their environments? Social 
scientists have long confronted this ques- 
tion by estimating rates of human violence 
after controlling for factors such as age, sex, 
race and income in large cohorts of individu- 
als drawn from a variety of circumstances. 
On page 233, Gomez et al.’ adopt a different 
approach: they use comparative methods from 
evolutionary biology* to reconstruct probable 
ancestral rates of lethal violence at the time of 
the origin of our species roughly 160,000 to 
200,000 years ago. 

One of Charles Darwin’ great insights was 
that all living things evolve by a process of 
descent with modification, such that species 
give rise to daughter species that inherit many 
of their ancestors’ traits. Comparative biolo- 
gists can use the family trees (phylogenies) 


Mammals — 


that arise from this process to infer the his- 
tory of biological evolution, to date past 
events, and to reconstruct probable ances- 
tral features of species that lived hundreds of 
thousands to hundreds of millions of years ago. 

Gémezand colleagues applied comparative 
statistical techniques to a phylogeny of mam- 
mals, which includes primates — the group- 
ing of mammals that comprises monkeys, 
great apes and the lineage that leads to mod- 
ern humans. The authors compiled informa- 
tion on more than 4 million deaths from 1,024 
mammal species drawn from 137 mammalian 
families (80% of the total number of mamma- 
lian families), including mice, horses, bats, 
rabbits and monkeys. Information for humans 
came from 600 studies, and was derived from 
palaeolithic samples (defined by the authors 
as 50,000 to 12,000 years ago), New World and 
Old World Mesolithic (12,000 to 10,200 years 
ago) and Neolithic (10,200 to 5,000 years ago) 
sites, Bronze Age (5,300 to 3,200 years ago) and 
Iron Age (3,200 to 1,300 years ago) samples, 
and anthropological sources from the past few 
centuries. 

‘The authors then calculated the propor- 
tion of deaths attributable to violence from a 
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Figure 1 | Rates of lethal violence ina mammalian family tree, Gémez et al” used data about causes 
of death in more than 1,000 mammalian species to determine for each species the proportion of deaths 
caused by members of the same species. These data were grouped into a phylogenetic family tree for 
mammals, of which a simplified representation is shown here. The rates of lethal violence against 
members of the same species are indicated by a colour scale that ranges from low rates in light red to 


higher rates in darker red. 
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member of the same species out of all deaths 
counted for each species. Including only acts 
of within-species violence is a key point. Lions 
and tigers routinely kill members of other 
species, but they are less often lethally violent 
towards each other. Within-species violence 
therefore gets at what Hobbes and Rousseau 
disagreed about, and is interesting because 
members of the same species typically have all 
the same weapons, making violence between 
them risl 

Using the values from contemporary 
species, the authors reconstructed the rate 
of lethal violence (caused by members of the 
same species) at the phylogenetic origin of 
mammals at about 0.30%, which is approxi- 
mately 1 in 300 deaths, Rates of lethal violence 
then rose steadily over time throughout the 
mammalian phylogeny (Fig. 1) as the recon- 
structed ancestors drew closer to primates: the 
rate is about 1.1% for the ancestor of primates, 
rodents and hares; 2.3% for the common ances- 
tor of primates and tree shrews; then drops 
slightly to 1.8% for the ancestor of the great 
apes, The increases in lethal violence coincide 
with species having increasing amounts of 
group living and territoriality. Group living 
places individuals routinely in close contact, 
and territoriality means that groups might 
potentially compete over resources. Gomez 
and colleagues reconstructed the incidence 
of human lethal violence at the origin of our 
species at 2%, about six times higher than the 
reconstructed mammalian value. 

Gémez and colleagues’ study risks being 
misunderstood, so it is necessary to be clear 
about its interpretation. Humans emerged 
from an evolutionary lineage with a long 
history of higher-than-average levels of lethal 
violence towards members of the same species. 
Even so, followers of Rousseau might step in 
to say that our species’ figure of 2% tells us 
nothing about our innate tendencies; it might 
merely reflect a calculated or environmen- 
tally induced response to the environments in 
which early humans lived. 

Perhaps, but this objection falters when we 
appreciate that species that live in particular 
kinds of environments over long periods of 
time tend to adapt to those environments 
genetically, and this makes some kinds of out- 
comes more likely than others: a wolf raised as 
a sheep will probably one day turn on its fellow 
sheep. For this reason, the authors’ finding of 
a steady increase in violence throughout the 
100 million orso years of the mammalian tree 
is important — there was plenty of time for our 
ancestors to acquire and bequeath us genetic 
adaptations towards lethal violence. Our 
nearest living relatives, the chimpanzees, with 
whom we share around 98% identity of our 
gene sequences*, form packs to hunt down and 
kill stray males from other chimpanzee tribes’, 
and their hunting parties bear resemblances 
to human hunter-gatherer warfare’. Even the 
usually peaceful bonobo Pan paniscus can 
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Figure 2| Primate violence. Aggressive behaviour can occur even in the normally peaceful bonobo, 


Pan paniscus. 


sometimes display violent behaviour (Fig. 

Some will object that itis difficult to derive 
reliable estimates of lethal violence. Antici- 
pating this, the authors test for several biases, 
including variation in sample sizes and 
sampling effort, and uncertainty about the 
phylogeny of mammals itself, showing 
that none of these qualitatively alters their 
results, They also find that species we expect 
to be violent, such as the predatory carni- 
vores, are violent, and species that we do not 
expect to be violent because they are mainly 
vegetarian, tend not to be. Finally, the authors 
show that rates of violence are heritable in the 
mammalian phylogeny, by demonstrating that 
closely related species tend to have similar 
levels of lethal violence. 

Still, the Rousseau camp might havea corner 
to fight, The authors’ estimates of rates of lethal 
violence in humans vary widely over time, in 
most cases too quickly to be attributable to 
genetic changes. Their palaeolithic samples 
have rates very close to the 2% predicted at the 
origin of our species, but then rates rise to as 
high as 15-30% (with high statistical uncer- 
tainty) in samples from between 3,000 and 500 
years ago, before declining in contemporary 
populations (approximately 100 years ago 
to the present day). The rise tends to corre- 
late with moving from an early pre-societal 
‘state of nature’ to tribal groupings and then 
to organized political societies that have a 
warrior class. 

Where does this leave us? Social scientists 
take note: the work by Gomez and colleagues 
opens up a new approach to uncovering 
the origins of human violence, giving good 
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grounds for believing that we are intrinsi- 
cally more violent than the average mammal, 
and their findings fit well with anthropolog 
cal accounts that describe hunter-gatherer 
societies as being engaged in ‘constant 
battles", But societies can also modify our 
innate tendencies. Rates of homicide in mod- 
ern societies" that have police forces, legal 
systems, prisons and strong cultural attitudes 
that reject violence are, at less than 1 in 10,000 
deaths (or 0.01%), about 200 times lower than 
the authors’ predictions for our state of nature. 
Hobbes has landed a serious blow on Rousseau, 
but not quite knocked him out. = 
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In vitro and ex vivo strategies for 
intracellular delivery 


Martin P. Stewart’, Armon Sharei! 


4, Xiaoyun Ding', Gaurav Sahay*, Robert Langer!” & Klavs F. Jensen! 


Intracellular delivery of materials has become a critical component of genome-editing approaches, ex vivo cell-based 
therapies, and a diversity of fundamental research applications. Limitations of current technologies motivate development 
of next-generation systems that can deliver a broad variety of cargo to diverse cell types. Here we review in vitro and 
ex vivo intracellular delivery approaches with a focus on mechanisms, challenges and opportunities. In particular, we 
emphasize membrane-disruption-based delivery methods and the transformative role of nanotechnology, microfluidics 
and laboratory-on-chip technology in advancing the field. 


applications, the efficient intracellular delivery of exogenous 

compounds and macromolecular cargo remains a long-standing 
challenge. The limitations of established delivery technologies have 
hampered progress in multiple areas as the potential of exciting new 
materials, insights into disease mechanism, and approaches to cell therapy 
are not fully realized owing to their delivery hurdles. ‘This challenge can 
be viewed through the lens of two broad parameters: cell type and target 
material, Existing technologies are mainly focused on addressing a subset 
of combinations, specifically nucleic acid delivery (that is, transfection) 
to immortalized cell lines and certain primary cells. Some of the most. 
exciting target cell types, such as stem cells and immune cells, are also the 
most difficult to address, Thus, methods of delivering almost any cargo 
molecule to any cell type are much needed. 

‘Although carrier-mediated delivery systems offer promise for nucleic 
acid transfection in vivo', membrane-disruption-based modalities are 
attractive candidates for universal delivery systems in vitro and ex vivo. 
In this review, we begin with motivations driving next-generation intra- 
cellular delivery strategies and suggest relevant requirements for future 
systems. Next, a broad overview of current delivery concepts covering 
salient strengths, challenges and opportunities is presented. Following 
that, our focus shifts to prevalent mechanisms of membrane disruption 
and recovery in the context of intracellular delivery. Finally, we highlight 
the potentially transformative role of nanotechnology, microfluidics and 
laboratory-on-chip approaches in shaping the field. 


D espite its essential role in biological research and therapeutic 


Next-generation intracellular delivery 
Next-generation intracellular delivery solutions are required in 
diverse scenarios ranging from cell-based therapy and gene editing to 
regenerative medicine and fundamental biology (Fig. 1). Ex vivo cell- 
based gene therapies have shown promise in clinical trials against human 
disease, with exciting examples including self-renewing haematopoietic 
stem cells and cells for immunotherapy". In haematopoietic stem cells, 
gene therapy to correct mutations in monogenic diseases such as severe 
combined immunodeficiency (SCID)-X1, Wiskott~Aldrich Syndrome, 
and {i-thalassemia has been achieved. For'T cells, novel function against 
tumour targets can be instructed by induced expression of specific 
T cell receptors and chimaeric antigen receptors followed by adoptive 
cell transfer’, Non-essential proteins used by pathogenic processes can 


be deleted by delivery of genome-editing nucleases, such as was recently 
demonstrated with ablation of the HIV-dependent CCRS receptor 
necessary for infection in T cells’ or the haematopoietic lineage’. 
Moreover, induced secretion of cytokines, or programmed drug 
resistance and safety switches, can be engineered into these cell types 
by ex vivo manipulation*®, Recent breakthroughs in gene editing with 
programmable nucleases offer an unparalleled opportunity to reach 
many of these goals”"°. Further afield, in regenerative medicine the 
importance of not relying on potentially mutagenic viral vectors for 
induced pluripotent stem cell production has spawned reprogramming 
efforts by direct delivery of proteins!”, messenger (m)RNA transfection", 
and micro (mi)RNA delivery", among other possibilities. A common 
theme in these clinically relevant examples is the need to perform efficient 
and safe intracellular delivery. 

In basic research, nucleic acids, proteins, peptides, metabolites, 
membrane impermeable drugs, cryoprotectants, exogenous organelles, 
molecular probes, nanodevices and nanoparticles are all potential target 
materials for intracellular delivery. Despite their limitations, current 
delivery capabilities, largely centred around nucleic acid delivery to cell 
lines, have already yielded dramatic progress with plasmid (p)DNA and 
mRNA for gene expression and small interfering (si)RNA and miRNA 
for gene silencing. Meanwhile, systematic delivery of protein biologics 
into living cells, such as active inhibitory antibodies and stimulatory 
transcription factors, represent a powerful yet largely untapped tool for 
decoding and engineering cell function'*, Measurement of intracellular 
chemical and physical properties with innovative devices, sensors 
and probes is another frontier'®, Probes engineered from functional 
nanomaterials—including nanoplasmonic optical switches'®, carbon 
nanotubes"? and quantum dots'*—have generated excitement in 
research communities for decades but ineffective intracellular delivery, 
a poor understanding of their interaction with biological environments, 
and toxicity issues have retarded their deployment in the cellular 
context. These delivery challenges are particularly acute in the case of 
important patient-derived cell types such as immune cells, stem cells and 
neurons!?", 

‘Taken together, the preceding examples (Fig. 1) provide compelling 
motivations and requirements for next-generation intracellular delivery 
systems. In Table 1 we propose a set of guidelines to be considered by 
inventors. Regarding cell-based therapies, for example, decades of 
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Figure 1 | Intracellular delivery is a key step in investigating and be employed in cell-based therapies, regenerative medicine, and genetic 
engineering cells. The schematic depicts examples of application areas modulation. iPSC, induced pluripotent stem cell; TALEN, transcription- 
and molecular tools that require intracellular delivery for their realization. _activator-like effector nuclease; ZEN, zinc finger nuclease; ssODN, single- 
‘The subsets are not mutually exclusive. For example, gene editing may stranded donor oligodeoxynucleotides. 


clinical trials indicate that the risks of ex vivo culture include karyotype _ viability while minimizing time in culture are crucial. For gene editing, 
abnormalities, genotoxicity, and exhaustion of proliferative potential’. a key issue is that transient ‘hit and run’ exposure of nucleases is often 
Hence rapid and safe delivery protocols that maximize efficiency and cell_ more favourable than indirect expression from DNA or mRNA, because 


Table 1 | Ideal features of next-generation intracellular delivery systems 


Feature 


Justification 


Minimal cell perturbation 


Scalability 


Universal acrass cell 
types 


Material independent 


Compatible with 
intracellular targeting 


Dosage control 


Cost 


~The exogenous vectors, materials or physical forces required to facilitate delivery can lead to off-target effects and toxicity 

Prolonged culture duration associated with delivery and verification can lead to unintended fate or phenotypic changes, loss of 
proliferative or homing potential, genotoxicity, and karyotyping abnormalities and accumulated mutations 

+ By minimizing the physical or biochemical manipulation necessary to achieve delivery, one can reduce undesirable side effects and 
maximize efficacy of the delivery process 


+ Effective delivery systems must be amenable to implementation at diferent scales of throughput 

+ Studying a rare cell subset may require only 100 cells per sample whereas an adoptve-transter T-cell therapy can involve transfer 
of over 108 cells per patient 

+ Considering processes complying with current good manufacturing practice (cGMP) at an earlier stage may accelerate clinical 
translation 


+ An ideal delivery system should be able to accommodate the diversity of physical and biological properties of potential target cells 
to ensure efficacy in the applications of interest; this could be accomplished through exploiting relatively universal mechanisms to 
facilitate delivery 


+ Delivery materials of interest may have diverse chemical and physical properties 

+ To facilitate robust delivery, an ideal delivery system should rely on a delivery mechanism that is independent of material 
properties (for example, ifa delivery technology relies on electrophoresis to facilitate delivery it may not be compatible with 
uncharged materials) 


+ Distinct materials have different target sites within the cell (for example, SIRNA and mRNA should facilitate the desired gene 
knackdown or expression effects in the cytosol, whereas DNA transcription requires nuclear localization) 

+A preferred delivery system must be compatible with various intracellular targeting strategies; specifically, it should provide robust 
delivery af material to the cytosol and not interfere with targeting motifs (for example, a nuclear localization peptide sequence) on 
the material 


«The ability to control the maximum and minimum intracellular concentration may be key in some applications 
+ Strategies that rely on indirect delivery by expression fram nucleic acids are subject to cell-to-cell variation and inherent signal 
amplification associated with transcription 


+ Cost and complexity of production/operation can limit the utility ofa delivery technology 
+ Effective delivery systems should use scalable, cost-effective designs that are amenable to clinical translation, compliance with 
CGMP standards, and large-scale manufacturing 
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ityields more control over dosage concentration and efficacy, and reduces, 
off-target effects” For induced pluripotent stem cells, delivery challenges 
may impede the use of the ideal combination of proteins, nucleic acids, 
and small molecules to provide the optimal reprogramming outcome. 
Furthermore, low-cost transfection remains a barrier in many sensitive 
cell types at the level of basic research, and is even more problematic 
when considering scale-up for clinical protocols or industrial processes. 
‘Therefore, next-generation intracellular delivery strategies must strive 
to address the decades-old challenge of delivering diverse cargoes to the 
intracellular space of a wide range of cell types. 


Many roads to one destination 
Intracellular delivery can be achieved by a range of carrier-based or 
membrane-disruption-based techniques (Fig. 2). Membrane-disruption 
modalities are primarily physical, involving the introduction of transient, 
discontinuities in the plasma membrane via mechanical, electrical, 
thermal, optical or chemical means, These approaches can be thought 
of as permeabilization or direct penetration modalities. A cell becomes 
permeable to a substance when disruptions in the membrane are of 
sufficient size to allow passage through the membrane. Alternatively, 
direct penetration employs a solid conduit or vehicle to concurrently 
penetrate the membrane and introduce cargo, Carrier-based approaches 
comprise various biochemical assemblies, mostly of molecular to 
nanoscale dimensions. The purpose of carriers is threefold: (1) to package 
the cargo and protect it from degradation, (2) to gain access to the 
intended intracellular compartment, and (3) to release the payload with 
the appropriate spatiotemporal dynamics. Carriers can be bio-inspired, 
such as reconstituted viruses, vesicles, cell ghosts, and functional ligands 
and peptides. They may be based upon synthesis techniques from 
chemistry, materials science and nanotechnology, involving assembly of 
nanoparticles and macromolecular complexes from organic and inorganic 
origins. Most carriers enter through endocytosis but some may exhibit 
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fusogenic potential, endowing them with the ability to merge directly 
with the target membrane. 

In the case of nucleic acid delivery (that is, transfection), vectors are 
defined as constructs that contain foreign DNA for the purpose of expres- 
sion or replication. Major vector types are plasmids, cosmids, episomes/ 
artificial chromosomes, and viral vectors, of which only viral vectors are 
capable of unassisted entry. Viral vectors exploit the viral infection path- 
‘way to enter cells but avoid the subsequent expression of viral genes that 
leads to replication and pathogenicity”. This is done by deleting coding 
regions of the viral genome and replacing them with the DNA to be 
delivered, which either integrates into host chromosomal DNA or exists 
as an episomal vector, At present, viral vectors are the most clinically 
advanced nucleic acid delivery agents owing to their high efficiency and 
specificity. They have been implemented in clinical trials for decades, 
being viewed as a promising approach for gene therapy". In 2015, more 
than half of the gene therapy submissions to the Federal Drug Agency 
(EDA) relied on viral vector platforms based on lentivirus, retrovirus, 
adenovirus, or adeno-associated virus. For ex vivo applications, lentiviral 
transduction of the haematopoietic lineage is a prominent example™, 
However, challenges such as immune response, safety and complexity of 
preparation are concerns for viral vectors?3 and so viral systems have 
struggled to gain FDA approval. To address these issues, researchers are 
developing new vectors and serotypes*. Table 2 presents a comparison of 
strengths, challenges and opportunities for viral vectors, non-viral carriers 
and membrane-disruption-based delivery. 

Motivated by the limitations of their viral counterparts, hundreds of 
non-viral vectors and synthetic carriers have been designed, using vast 
combinations of lipid, polymer, and inorganic nanomaterials, sometimes 
featuring functionalization with ligands, cell-penetrating peptides and 
other targeting or stabilizing agents (see reviews'****). Most carriers are 
designed for nucleic acid transfection, but recent efforts seek to expand 
their ability to co-deliver proteins or other biomolecules’. Nearly all 


Carrier-mediated 
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Figure 2 | Map of the relationship between intracellular delivery 
approaches, basic mechanism and conventional physical and 
biochemical categorizations. Physical techniques produce membrane 
disruption either via permeabilization or direct penetration, while 
biochemical assemblies and viral vectors act as carriers to shuttle 
cargo through endocytosis. Ifa carrier has fusogenic potential, it may 
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also enter through membrane fusion, Some biochemical approaches, 
such as detergents and pore-forming proteins, work via membrane 
permeabilization. Schematics at the top show the four subcategories with 
molecular cargo (grey), membrane (double lines), and carrier material 
(purple). 
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Table 2 | Strengths, challenges and opportunities for intracellular delivery approaches 


Carriers 
Membrane disruption 
Wal vectors Nonwial 
Strengths Amenable to both in vivo and in vitro ~Amenable to both in vivo and in vitro ~ Delivery of diverse materials 
translation translation + Capable of addressing many cell types 
+ Experience gained fromadvanced status + Packaging of delivery material can protect + Optionally vector-free (that is, nan-immunogenic) 
in clinical trials payload from premature degradation and + Transient, defined exposure to membrane 
“High efficiency of intracellular delivery due potentially enable more efficient use of disruption 
toviral exploitation of infection pathway valuable materials + Rapid, almost instantaneous delivery 
+ Capable of cell-specific and intracellular 
targeting 
+ Passive, potentially high throughput 
Challenges» Can trigger adverse host immune “inefficient and slow delivery, especially far _« Loss of cytoplasmic content 
response carriers that enter via endocytosis (about + Some modalities (such as thermal and 
+ Limited to nucleic acid delivery 1% endosomal escape) electric) can lead to excessive damage to organic 
(transduction) + Carrier materials may perturb cell function molecules, protein denaturation, and internal 


+ Limited genome size 

+ Many are cell-cycle dependent 

+ Preparation may be expensive, time 
consuming, require extensive experience, 
and demand special safety measures 
(for example, 812) 

+ For integrating viruses, risk of genotoxicity 

+ Limited tropism may restrict target cell 
types 

+ Manutacturing challenges for scaling up, 
for example, quality control for vector 
potency 


or cause toxicity in unpredictable ways 

«In vivo targeting outside the liver has been 
difficult 

+ Often restricted to delivering particular 
types of cargo (such as nucleic acids) 

+ Unpacking kineties may be unfavourable 

+ Complex, laborious and expensive 
biochemistry or materials synthesis may 
be required for carrier preparation and 
manufacturing 


membrane breakdown 

+ Some approaches (such as microinjection) 
currently nat amenable to high throughput 

+ Some methods can be restricted to adherent-only 
or suspension-only cells 

+ Less amenable to Jn vivo translation 


Opportunities 


+ Less-immunogenic vectors or evasion of 
immune response 

+ Programmable tropism and specificity 

+ Further development of hybrid viral 
serotypes 

+ Potential of alternative viral species for 
new vector development 

+ Improved production methods 


* Novel carriers that can efficiently 
co-deliver diverse cargos (for example, 
proteins + nucleic acids) 

+ Manipulation of target cell biology to 
regulate membrane trafficking and 
increase delivery efficiency 

+ Stimuli-sensitive nanocarriers 

+ Leveraging direct fusion to bypass 
endocytosis (for example, exosomes) 

+ Implementation of biomimetic 
functionality inspired by viruses, 
bacterial machinery, and exosomes 


* Microfabrication and nanotechnology enables 
fine control of physical phenomena and makes 
madilities that were previously intractable more 
feasible 

+A better understanding of cell recovery processes 
may allow rnitigation of toxicity and functionality 
issues 

+ Can be combined with carriers designed for 
subcellular targeting and controlled release 

+ Engineered switchable valves in plasma 
membrane to dynamically control permeability 

+ Versatile potential for in vitro and ex vivo 


applications 


‘See tox for further dataie and reterences 


carriers are taken up via specific endocytic pathways based on their 
cell surface interactions and physicochemical properties", To reach 
the intended intracellular target, the cargo must escape endosomal 
progression, which otherwise leads to degradation in lysosomes or 
regurgitation back to the cell surface". For lipid nanocarriers, which are 
considered the most advanced non-viral vectors for nucleic acid delivery, 
quantitative studies reveal that approximately 1% of the nanocarriers 
escape from endosomes®. The exact mechanisms of escape remain 
elusive, however. Proposed explanations include endosome disruption, 
either by formation of transient lesions or vesicle lysis; active transport of 
dissociated products; or fusion of carriers or multi-vesicular bodies with 
the outer limiting membrane. Apart from endosomal escape, another 
consideration is the kinetics of cargo release from the carrier, where 
delayed unpacking has been reported as a bottleneck to transfection 
efficiency". Moreaver, toxicity of carrier material and perturbation to 
membrane trafficking processes have been noted***', Manipulation of 
the host cell biology, using small molecules for example, represents an 
opportunity for boosting endosomal escape and delivery efficiency”. 
‘The design of stimuli-sensitive nanocarriers that respond to selective 
endosomal or intracellular conditions could also lead to improvements*. 

Carriers with fusion capabilities circumvent endocytosis by releasing 
their cargo directly into the cytoplasm. These systems were first inspired 
by viruses that deploy specialized surface proteins to induce fusion with 
target membranes”, Fusogenic carriers are bound by a phospholipid 
bilayer that hosts the fusion machinery. Examples include cell ghosts, dead 
cells that have their cytoplasm replaced with cargo"*”, and virosomes, 
loaded vesicles reconstituted to present functional viral proteins*, More 
recently, cell-derived vesicles known as exosomes have been discovered 
to fuse with target cell membranes for the exchange of RNA and proteins 
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between immune cells®. Although the exact fusion mechanisms are yet to 
be elucidated, such bioinspired systems may represent a new generation 
of vehicles with which to overcome the poor efficiency and toxicity of 
synthetic carriers™”. 

An inherent limitation of carrier systems is the restricted combination 
of feasible cargo materials and cell types. Target cells may not exhibit 
the appropriate receptors, surface interactions, endocytic activity, or 
endosomal escape pathways. Furthermore, potential cargo materials 
often display enormous variability in their properties, such as charge, 
hydrophobicity, size, mechanical properties, composition, and functional 
groups. They may not efficiently complex with the carrier, tolerate 
packaging, unpack properly, or be amenable to delivery in sufficient 
quantities for a given application, For example, cationic lipids readily 
form complexes with anionic nucleic acids to transfect most immortalized 
cell lines, but many blood and immune cells remain recalcitrant?"*!, 
On the other hand, cell-type-specific uptake can be a characteristic 
deliberately employed to achieve controlled targeting of a cell 
population, Furthermore, owing to the mechanism of passive dispersion, 
carrier-based delivery is scalable and amenable to high throughput, with 
the ability to concentrate and protect limited amounts of cargo material 
for potent delivery. Related to these strengths, lipid nanoparticles and 
more minimalistic, compact conjugates are now in human clinical trials 
for in vivo delivery of therapeutic siRNAs'*. 

Unlike carriers, membrane-disruption-based approaches are less 
dependent on cargo properties, being able to deliver almost any sub- 
micrometre material dispersed in solution. The ability to rapidly switch 
membrane-perturbing effects on and off enables temporal control and 
rapid, almost instantaneous delivery. A further strength of membrane- 
disruption techniques in vitro and ex vivo is the broad range of cell types 


part of Springer Nature, All rights reserved. 


and materials that can be addressed. Electroporation, for example, 
has a reputation for transfecting primary cell types that are otherwise 
recalcitrant to lipid nanoparticles and other non-viral transfection 
agents", Membrane-disruption-based approaches may furthermore 
be combined with carriers to synergize the strengths of both, such as 
by delivering a nuclear-targeted DNA lipoplex to the cytoplasm". 
Membrane-disruption-based delivery has also enabled several 
protein-delivery applications, featuring antibodies, transcription factors 
and genome-editing nucleases!*?"56, In primary human haematopoietic 
stem cells and T cells, for example, it was found that expression of Cas9 
nuclease and guide RNA from plasmids was poorly tolerated, while 
direct delivery of Cas9-sgRNA complexes via electroporation improved 
efficiency, reduced off-target effects and normalized dosage control”, 
Such results highlight a trend towards direct delivery of macromolecules 
rather than their indirect expression from vectors. 

‘Traditionally, key weaknesses of membrane-disruption strategies have 
been (1) the inconsistent level of cell-to-cell plasma membrane injury, 
with too little rendering insufficient delivery and too much causing 
excessive cell damage; (2) poor throughput and scalability (for example, 
microinjection); and (3) inadequate understanding of cell recovery, 
resulting in inefficient protocols‘, Methods that employ severe 
thermal shock or electric fields may denature proteins or damage cell 
components". Moreover, because of how membrane perturbation is 
administered in these techniques, they are often restricted to adherent 
or suspension cells, To overcome such challenges, new technologies are 
reinvigorating old approaches and concepts, suggesting that this may be 
a critical time in the development of in vitro and ex vivo intracellular 
delivery approaches (Box 1), Membrane disruption has seen promising 
advancement in recent years through nanotechnology, microfluidics, 
and laboratory-on-chip devices. Next, we cover the fundamentals 
of membrane disruption and repair before highlighting examples of 
technological progress in the field. 


The make and break of a cell membrane 
‘The plasma membrane can be perturbed by physical means, with 
mechanical force, thermal deviations, electromagnetic radiation 
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and electric fields, or by appropriate biochemical agents, such as 
membrane-active peptides, detergents and pore-forming toxins” (Fig, 3a). 
A relatively straightforward mode is mechanical disruption, where 
in-plane tensile strains of 2%-3% rupture a lipid bilayer®'. Such 
mechanical disruption can be administered through solid contact, 
fluid shear***, or hydrostatic/osmotic pressure™. Depending on the 
contact area and strain rate the applied force may either disrupt the 
membrane immediately, or first deplete membrane reservoirs”. For 
example, sharp objects like microneedles concentrate the force toa small 
region and presumably penetrate rapidly, while the ‘blunt’ and relatively 
slower onset of an osmotic shock tends to deplete global reservoirs before 
producing disruptions*’. Thermal deviations may promote membrane 
defects through several mechanisms. First, the higher kinetic energy 
associated with supraphysiological temperatures gives rise to more intense 
molecular fluctuations and subsequent dissociation of lipids. In live 
cells it has been shown that leakage begins at 42°C while temperatures 
above 55°C promote rapid exchange of low-molecular-weight (<1 kDa) 
molecules". Second, at the opposite end of the spectrum (<0°C), 
formation of ice crystals can trigger mechanical expansion and cracking 
of the cell membrane, which may be reparable upon thawing". Third, 
within the physiological range (0-40°C), passing the membrane rapidly 
through thermal phase transitions may lead to the generation of holes, 
especially at phase domain boundaries". In electroporation, the electric 
charging of non-conducting lipid bilayers and ever-present thermal 
fluctuations conspire to create and expand a heterogeneous population 
of pores®?, As membrane potential rises beyond 0.2 V, hydrophobic defects 
of thermal origin readily transition over their energy barrier and expand 
into hydrophilic pores of 1 nm or more. Because small pores are poor 
conductors, their growth is energetically favourable while the field 
is maintained but slows down when they grow to a certain size, 
Depending on the orientation of the electric field, these small pores may 
be unevenly distributed across the cell surface". Alternatively, optopo- 
ration with lasers produces a hole ata discrete point on the cell surface, 
and may involve a combination of mechanical, thermal and chemical 
effects, depending on pulse parameters®. Possibilities include thermal 
dissociation, thermoelastic stresses, effects beyond the focal region 
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The field of intracellular delivery arguably began with the advent of 
microinjection® in 1911. Since then a broad range of options have evolved, 
which can be categorized into carrier-mediated or membrane-disruption- 
mediated (Fig, 2). The timeline highlights trends in the fel. 


For carrier-mediated approaches, early research noted that several cationic 
‘compounds readily complex with the negatively charged nucleic acids to 
facilitate uptake of DNA and RNA. Examples include precipitates formed with 
diethylaminoethyl (DEAE)-dextran and calcium phosphate. Inspired by these 
initial findings, chemical complexes and modified viruses were subsequently 
deployed as tools for DNA transfection. Since then hundreds of vial, lipid, 
polymer, and inorganic carriers have been developed, mostly for nucleic acid 
transfection!?#9, Recent efforts have focused on multifunctional capabilities 
conferred through nanotechnology*9 and biomimetic strategies”® such as 
exosames”, direct conjugates’, and new generations of viral vectors2, 


Membrane-disruption-based approaches have evolved in parallel. Initially, 
available options included low-throughput microinjection®? or membrane 
perturbation with hypotonic shack®®, The demonstration of DNA transfection 
by electroporation in 1982”” sparked an era of widespread experimentation 
with other membrane-disruption modalities, including laser optopora- 

tion®, scrape/bead loading, syringe loading®®, acoustic sonoporation®, 
ballistic particles*, and permeabilization with detergents and pore-forming 
agents”, Electroporation rapidly gained a foothold as commercial products 
were launched from the mid-1980s. Most other membrane-disruption tech- 
niques were not broadly adopted, presumably owing to poor cell recoveries, 
limited throughput, need for specialized equipment, high cost, or skil-de~ 
pendent operation” In the past decade, nanotechnology, microfluidics, 
and laboratory-on-chip platforms are emerging as new possibilities to 
reinvigorate membrane-disruption-mediated approaches. 
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Figure 3 | Membrane disruption and recovery in the context of 
intracellular delivery. a, Schematic of plasma membrane disruption 

by mechanical forces (solid contact and fluid shear), thermal effects, 
focused lasers, electric fields, pore-forming agents that assemble into 
complexes, and the action of detergents that solubilize membrane lipids. 

b, Selected mechanisms of plasma membrane repair in the literature. Small 
disruptions (<100 nm) can be removed via endocytosis (i) and exocytosis 
or shedding of blebs (ii). Repair of large disruptions involves patching with 


(such as shock-wave emission and shear stresses from induced cavitation 
bubbles) and generation of low-density free-electron plasma and reactive 
‘oxygen species. The latter leads to chemical degradation via peroxidation 
or reactive fragmentation of biomolecules®*", Membrane defects from 
continuous-wave lasers are thought to arise from local heating, while 
nanosecond lasers produce a combination of heating, bubble formation 
and thermoelastic stresses. Femtosecond laser mechanisms are tunable 
on the basis of irradiance strength and repetition frequency, ranging from 
almost purely chemical effects to combinations of chemical and thermal 
degradation"®. 

Various biochemical agents have been used to permeabilize cells, the 
most eminent of which are detergents and pore-forming toxins, Pore- 
forming toxins approach the cell membrane as soluble agents, bind to 
the cell surface, oligomerize, and insert as an assembled pore complex”. 
‘The most often reported is Streptolysin O, owing to its ability to generate 
large pores of over 30nm in diameter for the passage of proteins and 
large molecules*. Alternatively, detergents act by solubilizing membrane 
components. The amphiphilic plant glycosides digitonin and saponins are 
the most popular detergents for reversible permeabilization of cells*”**, 
Although the exact mechanisms are still a matter of study”, itis known 
that digitonin and saponin interact with membrane cholesterol, making 
them specific for the cholesterol-rich plasma membrane. However, the 
reported pore sizes are inconsistent, varying from a few nanometres up 
to the micrometre scale”, 

Upon plasma membrane injury, the cell must either reseal or die. Along 
with the intended influx of cargo, there is the entry of Ca?*, and the efflux 
of K*, proteins, amino acids, metabolites, ATP, and other cytoplasmic 
contents to contend with”, Inundation with reactive oxygen species and 
other toxic molecules may cause further damage to endogenous proteins 
and biomolecules™. Thus, cells urgently deploy repair pathways to reseal 
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endomembrane from intracellular vesicles (contents in light blue) (iii). 
Sce recent reviews for further details”. c, Timescales of intracellular 
delivery via membrane disruption. Initially, membrane disruption permits 
the exchange of intracellular and extracellular contents, including cargo 
(green). Upon repair of membrane barrier function, delivered materials 
are retained while cellular recovery processes work to restore cytoplasmic 
homeostasis, PFT, pore-forming toxin. 


membrane disruptions and recover from the damage imposed (see 
recent reviews”), This repair is an active process, primarily triggered 
by the influx of Ca?+ down its 10,000-fold concentration gradient’®. 
‘There is a marked consistency in the properties of the repair pathways, 
regardless of whether the source of damage is electrical, mechanical, 
optical, or even chemical”, Instead, membrane recovery is thought to 
depend on disruption size, collateral damage, temperature, composition 
of the extracellular medium, and cell type. Up to six membrane repair 
pathways have been proposed, primarily involving membrane-trafficking 
processes around the defect”. As the exact mechanisms remain 
controversial”7!7574, we illustrate three broad concepts here (Fig. 3b). 
First, Ca?+-dependent exocytosis and fusion of membrane-proximal 
vesicles leads to resealing by patch formation’, but may also serve 
to reduce tension around the wound, or release lysosomal signals for 
membrane remodelling”. Second and third, for smaller disruptions 
of several hundred nanometres or less, endocytosis or exocytosis, 
respectively, may extract the lesion into a disposable vesicle. The 
timescales of these repair processes are anywhere from a few seconds to 
several minutes. 

‘Taken together, to achieve effective membrane-disruption-based 
delivery, the disruption must be sufficient to introduce the intended 
cargo, yet the cell must be capable of repairing itself without permanent 
damage. The membrane-disruption step usually occurs ona sub-second 
timescale; molecules then diffuse, or are driven, into the cell while repair 
and the associated contraction of holes takes place over seconds to 
minutes (Fig. 3c). This is followed by a longer phase involving restora- 
tion of cytoplasmic composition, stress response and possible alterations 
in transcription”®, The influx and retention of molecules will depend on 
the size, lifetime and distribution of disruptions, as well as the properties 
of cargo molecules and their interaction with the cell™, Electroporation, 
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for example, provides an electrophoretic force that may boost the influx 
of certain charged molecules*7* 


Towards precision membrane disruption 
For decades, a leading delivery technique has been electroporation, with 
its ability to introduce diverse biomolecules to millions of cells per run. In 
a conventional setup, a solution with suspended cells is dispersed between 
parallel plates that apply a series of electrical pulses of determinable 
voltage, duration, waveform and frequency™**°””, Compared to bulk 
electroporation setups, microfluidic designs offer the ability to localize 
the electric field to the scale of the cell (Fig. 4a). They can reduce the 
required voltage (often by 100-fold), provide superior heat dissipation, 
and incorporate flexible design features, such as hydrodynamic focusing 
to distance cells from potentially damaging electrodes”. An elegant and 
uncomplicated design was reported by ref. 79, who combined constant 
voltage with cells flowing through constrictions. The pulse strength was 
dictated by the cross-sectional ratio between the main channel and the 
constrictions, while the pulse duration was determined by speed of 
passage, thus avoiding the need for a pulse generator. Electroporation 
with a constant direct-current voltage has also been demonstrated for 
cells in aqueous droplets", Owing to the non-conductivity of oil, cells 
only experience a transient electric pulse when the conductive droplets 
pass the electrodes. At the nanoscale, ref. 81 described a nanochannel 
electroporation system that uses a ~90-nm aperture to localize the 
permeabilization to a single point on the cell surface, This innovation 
enables the generation of a single large hole rather than the numerous 
small pores characteristic of conventional electroporation, which is less 
amenable to free passage of large materials. For example, conventional 
electroporation appears to exploit the charge of nucleic acids, such as 
plasmids and mRNA, in order to partially embed them in pores, resulting 
in subsequent internalization through active membrane-trafficking 
pathways rather than direct delivery*', In contrast, nanochannel 
electroporation achieves improved dose control, enhanced electropho- 
retic delivery deeper into the cell, and the ability to deliver materials that 
bulk electroporation often cannot, such as quantum dots. 

Classic options for membrane disruption via mechanical perturbation 
include scrape and bead loading of adherent cells™ or syringe loading of 
cells in suspension by repeated aspiration and expulsion through small 
‘gauge needles®, These methods provide coarse, inconsistent damage over 
a target cell population while being high throughput and low cost. On 
the other hand, modern microfabrication technology is now enabling 
mechanical approaches with improved precision (Fig. 4b, c). Prominent 
examples include cell squeezing, nanoneedles, and exploding cavitation 
bubbles. Cell squeezing involves the rapid deformation of cells as they 
passage through microfluidic constrictions of around half to one-third 
of the cell’ diameter". Diffusive delivery of a variety of cargoes including 
proteins, nucleic acids, quantum dots, carbon nanotubes and other nano- 
materials has been demonstrated", A major strength of squeezing is the 
simplicity of the device, with no moving parts or need for an external 
power supply. The energy for membrane disruption comes from the flow 
through a static structure, Although the technology has shown appli- 
cability across dozens of cell types at throughputs up to a million cells 
per second, a current limitation is the correlation between cell size and 
delivery efficiency. For an asynchronous population, cells that are too 
large may be lysed, such as multinuclear cells, Alternatively, cells that 
are too small do not experience sufficient deformation to disrupt the 
membrane properly. This has prompted the design of various constriction 
geometries to address different sizes of cell". 

Nanoneedles are another key development over the last decade, 
involving the generation of nanometre-scaled features capable of 
penetrating the cell membrane and providing access to the cytosol™. In 
the reported modalities thus far, the target material is delivered through 
one of three modes: (1) dissociation from the penetrating structure upon 
cytosolic entry***°; (2) direct injection through a hollow nanoneedle 
or “nanostraw”*® or (3) the diffusion from the extracellular medium 
through holes after withdrawal of the needles. For the first mode, the 
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delivery of siRNA, peptides, DNA, proteins, and impermeable inhibitors 
to challenging cell types such as neurons and immune cells has been 
demonstrated", Studies of the mechanism suggest that puncture does 
not occur upon initial cell contact, but requires active forces generated 
by cell spreading and formation of tension-promoting focal adhesions 
In the second mode, molecules have been successfully pumped into cells 
through nanostraws. A key benefit of this configuration is temporal 
control over delivery dynamics, volume, and dosage concentration, as 
‘well as possible gating with electric fields***, In the third mode, ref. 44 
used a standard laboratory centrifuge to spin down a grid of diamond 
nanoneedles onto adherent cultured cells, followed by withdrawal and 
diffusive entry of cargo from solution, With this strategy it was possible 
to deliver-a wide variety of cargo to primary neurons while maintaining 
>80% viability. For nanoneedles of diameter about 300nm and height of 
about 41m, the required force for penetration was estimated to be 2 nN 
per needle, These mechanistic studies emphasize the need for active force 
to generate efficient membrane disruption. Thus far, the challenges of 
these systems have included the difficulty of fabricating high-precision 
structures and implementing the concept at scale. 

Moving from solid to fluid (Fig. 4c)**, cone-plate viscometers, devices 
for generating a determinable shear force over a surface, have also been 
shown to transiently permeabilize apical membranes in cell monolayers. 
Such observations inspired the design of microfluidic devices to expose 
cells to shear forces proportional to flow velocity through micro- 
channels that taper from 300j1m to 50j.m in diameter*”. Although an 
interesting concept, reproducibly controlling fluid shear forces at this 
scale has generally proven difficult. One way around this is to employ 
micrometre-sized cavitation bubbles®®. Ultrasound phenomena, also 
known as sonoporation in the context of delivery, can produce cavitation 
bubbles in solution‘’and has been an intriguing permeabilization 
approach since its introduction in the 1980s”, However, a recent analysis 
across multiple published data sets indicates that ultrasound consistently 
struggles to deliver molecules with greater than 50% efficiency or 50% 
viability in vitro". This was attributed to the operational mechanism 
of random and violent cavitation being heterogeneous, with some cells 
undergoing excessive damage while others remain unaffected. Targeted 
cavitation is a more promising idea, whereby a precisely positioned 
cavitation bubble is used to generate a local shear force at a given stand- 
off distance from a target cell”!”, Spatial control can be achieved by laser 
xcitation of an absorbent particle or substrate. Recently, this concept 
was scaled up for deployment with cell monolayers”. Substrates arrayed 
with pores lined by metallic absorbers were irradiated underneath adher- 
ent cells. Exploding bubbles were synchronized with active pumping to 
successfully introduce large cargo, such as living bacteria greater than one 
micrometre, into the cytoplasm of several cell types. 

Bulk thermal insults are capable of perturbing the membrane for 
delivery of small molecules but detrimental effects on cell function 
preclude their implementation®*". Thermal disruption may be more 
feasible if confined to a localized area (Fig, 4d). Indeed, gene transfection 
has been demonstrated with cell solutions processed through 
thermal inkjet printers”, although it is unclear whether membrane 
perturbation of passing cells arises from fluid shear effects at the nozzle, 
temperature spikes, or both. A more precise approach is the use of 
absorbent nanoparticles as nucleation sites for intense local heating. 
Upon laser irradiation local perturbation effects may be due to ther- 
mal, chemical, or cavitation-mediated fluid shear phenomena. Recent 
efforts indicate that tuning the laser pulse parameters and properties 
of the absorbing particles can bias the mechanism towards a particular 
mode”***, Although parameters are still being explored, a proof-of- 
concept study demonstrated conditions under which irradiation of 
gold nanoparticles was proposed to trigger localized thermal damage 
that permitted more than half of the treated cells to take up labelled 
antibodies”. 

As illustrated above, advances in micro- and nanotechnology are 
breathing new life into delivery modes that were once deemed impractical 
(Fig. 4). By concentrating precise membrane-perturbing effects to the 
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Figure 4 | Selected modes of bulk, microscale and nanoscale approaches for membrane-disruption-based intracellular delivery. Molecular cargo is 


shown in orange. 


cellular and subcellular scale, the potential exists to address applications 
that are underserved by current techniques. Early progress suggests that 
such techniques may be among the first real challengers to electroporation’s 
long-held dominance on membrane-disruption-based intracellular 
delivery. Nanoneedles and microfluidics, for example, have shown 
promising compatibility in stem cell reprogramming“, functional interro- 
gation of primary immune cells'®, and transfection of cultured neurons. 
‘To achieve further advances, delivery efficacy and safety must be combined 
with scalability, tunable throughput, low cost and user-friendliness. 


Outlook 
Every day in research institutes and clinical centres around the world, 
scientists use kits and protocols based on viral vectors, lipid transfection 
agents, and electroporation, among other options. The complex mecha- 
nisms of established methods and their often unpredictable impact on cell 
behaviour have dramatically limited the scope of biological experiments 
and reduced efficacy of potentially promising cell therapy concepts. 
‘The biomedical research community would benefit greatly from a more 
mechanisticand transparent understanding of intracellular delivery, both 
to further the development of more robust techniques and to realize key 
medical and industrial applications. 
Effective and safe intracellular delivery systems facilitate progress in 
multiple fields from cell-based therapies, gene editing and cutting-edge 
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‘genomics to reprogramming cellular states and probing the intracellular 
environment (Fig. 1). Demand for effective solutions currently outstrips 
supply by a large margin, however. Nowhere is this aberration more 
greatly felt that in the treatment of primary and patient-derived cells, 
including various types of immune cells, neurons and stem cells. Deep 
interdisciplinary coordination will be required to transform imaginative 
engineering solutions into technological platforms with biological 
compatibility and relevance. Mechanistic studies must seek to gauge 
delivery performance quantitatively, and to assess influx mechanisms, 
cell damage and off-target perturbations associated with treatment. 
‘More rigorous analysis of these factors will benefit the field, and counter 
the temptation to ‘overhype’ technologies that are yet to prove their 
utility in the clinic or laboratory", Issues related to scale-up, cost and 
compatibility with current goods and manufacturing protocols must be 
considered at an earlier stage of development. To this end, we anticipate 
that the guidelines in Tables 1 and 2 will assist researchers in selecting 
appropriate methods and help aspiring inventors to understand key 
requirements and areas of opportunity. 

‘As emerging intracellular delivery technologies take us beyond rou- 
tine nucleic acid transfection and enable robust manipulation of previ- 
ously recalcitrant cell types, we will be entering exciting new territory. 
‘The ability to deliver proteins, peptides, nanomaterials, molecular tags, 
and a variety of other compounds will enable unprecedented flexibility 
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in our capacity to manipulate cell function and probe the intracellular 
environment. The systematic deployment of transcription factors to alter 
gene expression, or antibodies to label or block intracellular processes, 
could be revolutionary. With next-generation intracellular delivery, the 
development of molecular probes and nanomaterial sensors with which 
to analyse intracellular properties could facilitate both discovery and 
diagnostics with unprecedented accuracy. Future approaches could 
focus on targeting specific subcellular compartments—developing 
combinatorial strategies to direct materials to the endoplasmic reticulum, 
nucleus or mitochondria, for example", 

“Advances in nanotechnology and microfluidics will continue to expand 
the frontiers of membrane-disruption-based delivery. Already, platforms 
harnessing the power of exploding bubbles, microfluidic squeezing, and 
nanoneedles have been transformed into commercial ventures. On the 
other hand, carrier-based technologies continue to develop, with new 
generations of viral yectors, endosome disruption strategies, stimuli- 
sensitive functional materials, and biomimetic inspiration from path- 
‘ogenic mechanisms and membrane-trafficking processes. Progress in 
these fields will lead to new challenges, such as off-target effects and 
innate cellular responses against carrier and cargo alike. Solving this 
next generation of problems may hinge on our ability to understand 
current delivery mechanisms and to implement the analytical approaches 
necessary to characterize cellular responses, Despite the barriers that 
remain, we anticipate that next-generation technologies will translate 
beyond academic endeavours into portable, personalized, cell-based 
diagnostics and the use of clinical intracellular delivery to engineer cell 
fate for therapeutic benefit, 
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The 2013-2016 epidemic of Ebola virus disease in West Africa was of unprecedented magnitude and changed our 
perspective on this lethal but sporadically emerging virus. This outbreak also marked the beginning of large-scale real- 
time molecular epidemiology. Here, we show how evolutionary analyses of Ebola virus genome sequences provided 
key insights into virus origins, evolution and spread during the epidemic. We provide basic scientists, epidemiologists, 
medical practitioners and other outbreak responders with an enhanced understanding of the utility and limitations of 
pathogen genomic sequencing. This will be crucially important in our attempts to track and control future infectious 
disease outbreaks. 
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Figure 1 | Evolution of EBOV during the 2013-2016 outbreak showing 
the extent and location of virus sampling. a, Sampling during the 
2013-2016 EVD epidemic. Sequencing efforts closely match confirmed 
and suspected case numbers in each administrative division of Guinea 
(green), Liberia (red) and Sierra Leone (blue) (Spearman correlation 
coefficient = 0.91). b, Map of the three countries most affected by EVD 
during the 2013-2016 EVD epidemic. Administrative divisions in Guinea, 
Liberia and Sierra Leone are shown in green, red and blue, respectively, 
and coloured according to the cumulative numbers of confirmed and 
suspected cases throughout the epidemic. Hatched areas indicate divisions 
that never reported any cases. The boundary data for the maps is from 
GADM (http://www.gadm.org). ¢, Temporal phylogeny ofall publicly 
available EBOV genomes estimated using BEAST”, Three lineages 


data set, the widespread spatial coverage (Fig. 1a), and the contemporary 
nature of the EBOV data provide the first in-depth genomic anatomy of 
an epidemic, setting a benchmark for future outbreak responses. Here, we 
describe how pathogen sequences produced during the 2013-2016 EVD 
epidemic provided key insights into EBOV genomic epidemiology and 
molecular evolution, and note the lessons that need to be learned for the 
effective study of future outbreaks. 


Ebola virus disease in humans 
Ebola virus (species Zaire ebolavirus) is one of four viruses—with 
Sudan virus, Tai Forest virus, and Bundibugyo virus—within the genus 
Ebolavirus that cause severe disease in humans and other primates. The 
final member of the genus is Reston virus, although infection with this 
virus does not appear to cause human disease™. All ebolaviruses are 
members of the family Filoviridae, which also includes Lloviu cuevavirus 
(genus Cuevavirus) and the severe human pathogen Marburg virus (genus 
Marburgvirus). It is believed that bats serve as the primary reservoir for 
EBOV"*“®, However, EBOV infections have been confirmed in only a 
small number of mammalian species and it is unclear whether the virus 
infects a wider range of animal hosts that have yet to be sampled. Some 
evidence fora broader host distribution, at least in the evolutionary past, 
comes from the observation that endogenous filoviruses are present in 
the genomes of diverse mammalian species, including marsupials". 
EBOV in humans was first described in Zaire (now the Democratic 
Republic of the Congo (DRC)) in 1976, where, over a two-month period, 
it led to an outbreak of 318 cases with an 88% case-fatality rate (CFR)**. 
CERs, however, are difficult to estimate for EVD®*®, so such numbers 
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identified in previous studies**!* are marked with coloured backgrounds. 
‘The sequence alignment was partitioned into four categories: codon 
positions 1, 2 and 3, and non-coding intergenic regions. Changes in each 
of the four partitions were modelled according to the HKY +1 nucleotide 
substitution model with relative rates between partitions. Tip dates 

were used to calibrate a relaxed molecular clock with rates drawn from. 

a lognormal distribution with an uninformative prior placed on the 
mean of the distribution. A flexible ‘skygrid’ tree prior was used to allow 
for changes in effective population sizes over time, Each tip is coloured 
according to the country where the patient was most likely to have been 
infected: green for Guinea, red for Liberia and blue for Sierra Leone. 

Data correct as of 19 April, 2016. Nature remains neutral with regard to 
jurisdictional claims in published maps. 


should be interpreted with caution. Between 1977 and 2014, 12 smaller 
outbreaks were reported in Middle Africa, with 32-315 cases and CFRs 
ranging from 47% to 89%". The 2013-2016 EVD epidemic is therefore 
notable not only for its duration and magnitude, but also as the first out- 
break in West Africa and the first in which case exportations and noso- 
comial transmissions were reported outside of Africa", However, despite 
the scale of the 2013-2016 EVD epidemic, infection with EBOV Makona 
appears to lead to similar disease characteristics and transmission profiles 
as previous EBOV outbreak variants, For example, the CFR for the 
2013-2016 EVD epidemic appears to be around 70%!" and estimates 
for the basic reproduction number (Ro) fall between 1.5 and 2.5, both 
of which are comparable to calculations from previous outbreaks”, 


Origin of the 2013-2016 Ebola virus disease epidemic 

Evolutionary analyses of genome sequences from the 2013-2016 EVD 
epidemic have provided a clear picture of the origin and spread of EBOV 
Makona'32°-224-2548, One of the most important early questions was 
whether the epidemic was the result of a single cross-species transmission 
event into humans, or whether there were repeated zoonotic events from a 
widespread animal EBOV reservoir. Owing to the high genetic similarity 
of virus genomes sampled from the beginning of the epidemic, a single 
spill-over infection seems the more likely'®, Phylogenetic analyses also 
make it clear that once the outbreak was established, later lineages of 
EBOV Makona had descended from those circulating earlier in the epi- 
demic? *5 (Fig. 1c). This is in contrast to some of the earlier EVD 
outbreaks, in which epidemiological and sequence-based investigations 


have provided evidence for multiple spill-over infections”, 
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Sequence-based findings are consistent with epidemiological investiga- 
tions into the timing of the 2013-2016 EVD epidemic, which placed the 
first case around late December 2013 in Guinea’. In agreement with this, 
molecular clock dating analyses suggest that the common ancestor of all 
sequenced EBOV Makona lineages be placed at the beginning of 2014°7!"°, 
with lineages in Guinea falling close to the root of the tree (Fig. 1c). 
‘These studies also showed that EBOV Makona diverged from other EBOV 
outbreak variants only about a decade ago**. This finding suggests that 
EBOV Makona may be fairly new to West Africa, sharing recent common 
ancestry with Middle African variants that are found thousands of miles 
away. Molecular clock dating analyses have also shown that all recorded 
human EVD outbreaks caused by EBOV appear to share a common ances- 
tor around 1975", Notably, this is around the time of the first described 
EVD outbreak in 1976, suggesting that the EBOV lineage experienced a 
severe genetic bottleneck before the first human outbreak™**, Despite 
their power", molecular clock dating studies of this type would undoubt- 
edly benefit from additional EBOV genomic sequence data from both 
previous EVD outbreaks and animal reservoir populations. 


Genetic diversification of Ebola virus Makona 

Because of the relatively small magnitude and duration of previous EVD 
outbreaks, earlier EBOV sequencing efforts were necessarily limited to 
small numbers of temporally clustered cases. The data from these earlier 
studies largely comprised single viral lineages and led to the perception 
that EBOV genomes remain stable over the course of an outbreak. 
However, the much larger size and duration of the 2013-2016 EVD 
epidemic (Fig. 1a, b) resulted in a different molecular epidemiological 
pattern for EBOV Makona, in which multiple virus lineages arose and 
co-circulated (Fig. 1c). 

Despite their shared border, the EBOV Makona genomes sampled from 
the three most affected countries, Guinea, Sierra Leone and Liberia, gen- 
erally (although not exclusively) form separate clusters on phylogenetic 
trees and exhibit different phylogenetic patterns™!-?225 (Fig. 1b, c). 
Genomic studies have shown that the 2013-2016 EVD epidemic was 
dominated by three major lineages, denoted A (refs 21, 25), SL2 (ref. 2) 
and SL3 (refs 2,3) (Fig, Le). Most ofthese lineages—including lineage 
A'S in Guinea, SL3 in Sierra Leone’ and Liberian isolates** 
Grculated locily, with only sporadic cross-border tranetnssion (Fig, 1c), 
By contrast, lineage SL2 (ref, 2) was the most widespread in the 
* (Fig. 1c). This lineage probably arose in Sierra Leone* 
where it gave rise to lineage SL3 and several sub-lineages**. It crossed 
more than twice into Liberia”, seeded several transmission chains in 
Guinea”! and spread throughout Sierra Leone”? (Fig. 1c). It is unclear 
whether any of these lineages carry mutations that could have affected 
their epidemic potential, or, pethaps more likely, whether the increased 
geographical spread of SL2 and SL3 isa reflection of chance epidemiolog- 
ical founding events (see below)". 


Evolutionary dynamics of Ebola virus Makona 
Although the origin and spread of the 2013-2016 EVD epidemic seem 
well resolved'">°-*22425, other aspects of EBOV evolution during this 
epidemic have proven more controversial. A major point of contention 
in both scientific publications”? and the popular press" has 
been whether the virus ‘mutated’ unusually rapidly during this outbreak. 
Unfortunately, much of this discussion is based on misrepresentations of 
‘what type of rate was measured and how these rates can be translated into 
predictions of phenotypic evolution. 

‘The starting point for the debate over how quickly EBOV Makona 
evolved was the observation by Gire et al” that the mean evolution- 
ary rate early in the epidemic was ~1.9 x 10~> (95% Bayesian credible 
interval: 1.11, 2.91 x 10~*) nucleotide substitutions (subs) per site per 
year. This rate was approximately twice as high as that averaged from 
genomic sequences of EBOV variants sampled from multiple outbreaks, 
at around 0.9 x 10-3 (0.81, 1.18 x 10-4) subs per site per year (ref. 2). 
EBOV outbreak variants are viral lineages responsible for human out- 
breaks, Other EBOV variants include EBOV Yambuku (Mayinga) from 
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Figure 2 | Evolutionary rates of EBOV compared to those of other RNA. 
viruses. a, Estimates of evolutionary rate in diverse RNA viruses. Green 
points at the top indicate the mean evolutionary rates estimated for EBOV 
during the 2013-2016 EVD epidemic from different studies, with solid 
lines showing the 95% credible intervals derived from BEAST analyses. 
Points at the bottom represent equivalent estimates (without uncertainty 
intervals) published previously for negative-sense single-strand RNA 
viruses (red), positive-sense single-strand RNA viruses (blue) and double- 
strand RNA viruses (purple)’, Points with the same shade belong to the 
same family. Evolutionary rate estimates for EBOV Makona occupy a 
narrow distribution within the range of rates observed in RNA viruses as 
a whole. b, 95% credible intervals for the distribution of evolutionary rates 
for EBOV from the 2013-2016 EVD epidemic published previously. *An 
erratum revised the mean evolutionary rate estimate for ref. 17 (Hoenen 
etal,), to 1.32 x 10-3 (95% credible intervals: 0.89, 1.75 x 10-3) subs per 
site per year. 


1976, EBOV Kikwit from 1995, and EBOV Lomela from 2014 (DRC). 
‘The between-outbreak evolutionary rate therefore reflects estimates aver- 
aged across all EBOY variants. However, later studies of EKOV Makona 
consistently produced lower rate estimates than those generated by Gire 
et al.’!2#5, Indeed, taking the publicly available sequence data as a 
whole, estimates of the EBOV evolutionary rate for the 2013-2016 epi- 
demic converge on a mean value of around 1.2 x 10-3 (1.13, 1.27 x 10°?) 
(Fig. 2), The ensuing discussions of whether EBOV is evolving more or 
less rapidly than expected, and what this means for the ability of the virus 
to evolve changes in transmissibility and virulence, have become com- 
mon narrative”, 

‘The debate over the evolutionary dynamics of EBOV highlights a num- 
ber of general issues in viral evolution. First, estimates of evolutionary rates 
are generally expected to be higher within outbreaks than between them. 
‘This is because the relatively short timescale over which sequences 
are sampled during outbreaks may be insufficient for mutations to be 
removed (or make them less likely to be fixed) by either natural selection 
or genetic drift. Hence, pathogen genomic sequences sampled early in 
epidemics will contain an excess of mildly deleterious variants that would 
eventually be eliminated by purifying selection". This will tend to inflate 
evolutionary rates and in part explains why evolutionary rates in RNA 
viruses are often ‘time-dependent’: high towards the present, low towards 
the past”!2, Indeed, it is notable that as the 2013-2016 EVD epidemic 
progressed, analyses of evolutionary rate in EBOV converged on a reliable 
estimate (Fig. 2), which is expected owing to the increasing size of the 
data set combined with a longer sampling period. When viewed in the 
context of viruses as a whole, itis also striking that all the evolutionary 
rate estimates for EBOV fall in a narrow range towards the centre of a 
distribution that spans more than three orders of magnitude, from about 
10° to about 10~* subs per site per year (Fig. 2). 

As well as time-dependence, it is possible that purifying selection 
may be relaxed in humans following cross-species transmission and/or 
that EBOV may undergo more replications per unit time during human 
outbreaks than in its reservoir species””®, Both of these scenarios would 
increase the within-outbreak rate. Potential evidence for fundamental 
differences in evolutionary dynamics associated with species-jumping is 
provided by the EBOV Lomela variant that emerged in the DRC during 
2014, causing a small EVD outbreak with 69 cases”, The branch length on 
the EBOV phylogenetic tree leading to the EBOV Lomela sequences from 
their common ancestor is far shorter than expected from their sampling 
time in 2014 (Fig. 3a), indicating a markedly lower evolutionary rate” 
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Figure 3 | Examples of violations of the Ebola virus molecular clock. 

a, Root-to-tip regression of genetic distances against time (month and 
year) of sampling for 105 representative EBOV variant sequences collected 
between 1976 and 2016 based on a maximum likelihood tree. b Equivalent 
root-to-tip regression of publicly available sequences from the 2013-2016 
EVD epidemic using data on the day of sampling, and the maximum 
likelihood tree on which the estimates were made, RAXML (ref. 99) 

(panel a) and PhyML (ref. 100) (panel b) were used to estimate the 
maximum likelihood phylogenies under an HKY +’, substitution model 
that was rooted via least squares regression in TempEst. Substitutions 
accumulate linearly with time, with some variation. Sequences recovered 
from transmission events that occurred as a result of persistent EBOV 
infection often exhibit temporal anomalies. In this scenario, EBOV may 
accumulate substitutions at a lower rate during persistence in individuals 
compared to regular person-to-person transmission, Larger red points 
indicate sequences of EBOV sampled from EVD survivor-associated 
transmission chains”*””’, Scale bar, nucleotide substitutions per site, 


‘This could reflect an evolutionary history in a different reservoir host 
from those previously described for EBOY, in which replication rates and 
hence evolutionary rates are reduced, or in which purifying selection acts 
with greater potency than in humans, Lower EBOV evolutionary rates 
were also observed in suspected cases of transmission from human EVD 
survivors during the 2013-2016 epidemic (Fig. 3b)"°””. Unexpectedly low 
evolutionary rates may therefore serve as an important signal for detecting 
probable transmissions from EVD survivors during flare-ups”*”* (Fig. 3b). 

‘The debate over EBOV evolutionary rate estimates has also revealed 
confusion over the terminologies used to describe the rate at which 
genetic changes accumulate. The most straight-forward measure of the 
rate of molecular evolution is the nucleotide substitution rate. This rate 
describes the frequency with which mutations are fixed in populations 
through time and for EBOV is best approximated by the rate observed 
between outbreaks" (Box 1). This rate reflects the long-term evolutionary 
processes including selective constraints on the genome, host-species- 
specific adaptation and the cumulative results of genetic drift. In contrast, 
the rate of change within outbreaks might be better thought of as the 
evolutionary rate, as the short timescale of sampling necessarily means 
that not all mutations observed will be fixed. Both the substitution rate 
and evolutionary rate can be clearly distinguished from the mutation rate. 
‘This term relates to the rate at which mutations are generated during viral 
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replication by intrinsic biochemical factors, and in particular to how fre- 
quently the viral polymerase makes errors™ (Box 1, Box | Figure). This 
rate is generally challenging to measure”*” and is unknown for most 
viruses, including EBOV. Its therefore unfortunate that the debate over 
EBOY evolution has focused on ‘mutation’ (and hence potential differ- 
ences intrinsic to particular virus lineages) when this is not the parameter 
that has been measured, 

Finally, itis too simplistic to think that a twofold variation in rate esti- 
mates for EBOV will result in radically different evolutionary behaviour, 
especially when seen in the context of RNA viruses as a whole (Fig. 2). 
‘The likelihood of meaningful adaptive evolution depends not only on 
the rate at which the virus is able to generate mutations, but also on those 
environmental and host factors that shape the selection pressures acting 
on the virus. That filoviruses have infected a wide range of mammalian 
hosts***7*" suggests that they are readily able to adapt to new environ- 
ments irrespective of potential differences in evolutionary rate. 


Phenotypic evolution of Ebola virus Makona 

While the patterns of EBOV molecular evolution during the 2013-2016 
EVD epidemic have been well characterized, it is not currently known 
whether any of the observed mutations have resulted in differences in 
viral phenotype. This is particularly the case with respect to such traits as 
antigenicity, transmissibility and virulence, or mutations that could have 
an impact on vaccines, therapeutics and diagnostics. Although genomic 
sequence data play a central role in understanding outbreak dynamics and 
evolution, revealing key aspects of viral phenotype using sequence data 
alone is fraught with difficulties, and may even be counterproductive to 
outbreak response by steering the focus away from more critical needs. 

‘As the 2013-2016 West African epidemic of EVD was so much larger 
than previous outbreaks, it is possible that EBOV Makona possessed 
mutations that enhanced its transmissibility in humans. Without direct 
experimental data, however, a simpler scenario is that the scale and 
severity of the 2013-2016 EVD epidemic reflects a different epidemi- 
ological context than previous outbreaks. Under this model, most, if 
not all, EBOV variants entering human populations after cross-species 
transmission have the ability to cause major epidemics, but have been 
unable to do so because of a lack of a susceptible host population and/or 
environment. In particular, previous EVD outbreaks occurred in largely 
isolated and rural areas" (with the notable exception of the 1995 out- 
break in Kikwit, which has a population of ~400,000 ref. 41), where 
there were either an insufficient number of susceptible people to guar- 
antee long-term transmission, or the outbreak was quickly controlled 
by efficient interventions. The 2013-2016 EVD epidemic, in contrast, 
was the first in West Africa and the first in which a large EVD epidemic 
resulted in sustained community transmission from rural settings to 
major urban centres, where it was easier to establish large-scale trans- 
mission networks, This included the establishment of ‘underground’ 
networks, amplified by reluctance to seek medical advice in the affected 
communities, which greatly hindered intervention strategies focused on 
breaking chains of transmission. That the scale of the 2013-2016 EVD 
epidemic more reflects virus epidemiology rather than virus evolution 
is also supported by the failure to find evidence for heritable changes 
in the duration of virus shedding or virulence during the course of the 
2013-2016 EVD epidemic? °"?*”, 

However, it was also the case that EBOV evolution during the 2013- 
2016 EVD epidemic was characterized by an abundance of changes in 
the nucleotide and amino acid sequences that could fuel adaptation 
for more efficient human transmission; any mutations that increased 
Ro would have been favoured by natural selection, Because of its key 
role in virus-host interactions, most attention has been directed 
towards the EBOV glycoprotein, and it is notable that the highest 
level of genetic amino acid diversity generated during the 2013-2016 
EVD epidemic occurred in the glycoprotein (in particular, its mucin- 
like domain)*“*, For example, we observed 104 amino acid changes 
in glycoproteins that were shared by at least two EBOV Makona 
lineages from the 1,500 available EBOV genomes that make up 5% 
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BOX! 
Different measures of gnome sequence change 


Mutation rate 

As viruses replicate, mutational errors are incorporated into the viral genome. The mutation rate is therefore typically expressed as the number 
of mutations per site, per replication event. The mutation rate for RNA viruses such as EBOV is largely determined by the viral RNA-dependent 
RNA polymerase, which lacks proof-reading activity. The estimation of mutation rates requires complex sequencing-based or phenotypic marker 
experiments that correct for the impact of natural selection’®. Mutation rates are unknown for most viruses and have not been determined for 
EBOY, although it would be predicted to be comparable to other (—)ssRNA viruses and is likely to be similar across all EBOV outbreak variants. 


Evolutionary rate 

The evolutionary rate of a virus can be defined as the observed rate at which new variants arise and spread in the viral population. This can 
be measured by methods that compare the genetic change in viral genomes collected at different times. Importantly, evolutionary rates in 
RNA viruses may be dependent on the timescale over which they are measured: they are elevated in the short-term, such as within disease 
outbreaks, because mildly deleterious mutations may not have been eliminated by purifying selection”*72, 


Substitution rate 

The substitution rate is best described as the long-term rate at which genetic variants become fixed in a virus lineage over evolutionary time- 
scales, such as between human outbreaks in the case of EBOV. Hence, this rate reflects the complex interplay of natural selection, genetic drift, 
modes of transmission and epidemiological processes. This rate will also usually be lower than the short-term evolutionary rate because many 
of the variants circulating within outbreaks and epidemics will ultimately be eliminated. Furthermore, saturation—repeated changes at the same 
site—will further reduce the measured substitution rate. 


Fixation rate 

An added complexity in estimating rates in RNA viruses is that the population genetic concept of ‘fixation’, central to the definition of 
substitution, is ill-defined. In slowly evolving organisms, fixation events can usually be distinguished from polymorphisms by analysing 
individual nucleotide sites within and between species. However, in rapidly evolving RNA viruses, fixation can be described to occur (1) at the 
level of individual hosts over the course of infection, (2) in viral lineages within specific geographic locations or epidemiological networks 
(Such as the different lineages of EBOV generated during the 2013-2016 EVD epidemic), (3) in global meta-populations, and (4) between 
different viral species. 


Observed variation 


Bor 1 Figure | Illustration of different measures of genomic variation. Mutations, 
accumulate over time. This phenomenon is at the core of molecular 
clocks, a class of methods that aim to convert molecular phylogenies 
with branch lengths given in expected substitutions per site into 
plausible temporal phylogenies in which branch lengths are given 

in time units and the trees themselves are embedded in time. By 
making use of sequences sampled at different times, such methods 
can estimate the evolutionary rate that provides the conversion from 
genetic distance into time. As phylogenetic methods have become 
ever more powerful and easily accessible, confusion has resulted from 
the frequent and interchangeable use of the terms mutation rate and 
substitution rate to signify the ‘molecular clock’ rate. Mutation and 
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of the more than 28,000 reported EVD cases’. While it is not known 
whether any of these amino acid changes led to functional differences, 
one plausibly important glycoprotein variant that originated early in 
the epidemic (in lineage $L2)** is an alanine to valine change at resi- 
due 82 (A82V). This is the first substitution observed in the receptor 
binding domain of EBOV and could potentially alter the interaction 
between the EBOV glycoprotein and its host receptor Niemann-Pick Cl 
(NPC1)*". Clearly, determining whether lineages of EKOV Makona 


carrying A82V or other mutations that arose during the 2013-2016 
EVD epidemic differ in epidemic potential should be a research 
priority. 

Irrespective of potential differences in transmissibility that are yet to 
be uncovered, it is more certain that EBOV Makona is no different from 
previous EBOV outbreak variants when it comes to bodily fluids being 
the primary route of transmission*', Early on in the 2013-2016 EVD 
epidemic there was high-profile speculation that EBOV could evolve 
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respiratory (that is, airborne) transmission due to genetic diversity in the 
viral population, However, there is no evidence for airborne EBOV 
transmission during the 2013-2016 EVD epidemic—or any other EVD 
outbreaks—and nor are there any examples of other viruses evolving a new 
mode of transmission on the timescale of individual outbreaks, Although 
influenza virus shifts its mode of transmission from (primarily) faecal-oral 
in its wild bird reservoir to respiratory in humans*, this change occurs at 
the point of cross-species transmission and not during human outbreaks. 

While the occurrence of airborne transmission can be eliminated for 
EBOY, studies using genomic sequence data have conclusively shown 
that sexual transmission plays a previously unappreciated role for EBOV 
dissemination and reignition””**-*, However, the long-term epidemio- 
logical and evolutionary implications of this mode of transmission are 
unclear and warrant further in-depth studies. 


Public health implications of genomic epidemiology 
In addition to providing essential information on the pattern and dynam- 
ics of viral evolution during epidemics, viral genomic data may be of more 
direct public health importance. Indeed, the 2013-2016 EVD epidemic is 
arguably the first in which genomic data have been used directly ina real- 
time public health setting, to inform policies and infection control272525, 
‘That some of these studies were undertaken under difficult field condi- 
tions! 252525 highlights the potential for portable genomic sequencing to 
transform outbreak responses”™. 

‘The simplest use of genomic data during outbreaks has been to reveal 
the pathways of viral spread through communities; when combined 
with phylogeographic approaches’?"®, the results can be used to direct 
intervention methods to transmission hot-spots and to determine the 
impact of specific interventions such as border closures, For example, 
‘Tong and colleagues used viral genome sequencing to show how EBOV 
spread from the capital city of Freetown to multiple districts throughout, 
Sierra Leone”, with Arias et al, later documenting how virus traffic from 
Freetown established new transmission clusters late in the epidemic. 
Similarly, phylogenetic analyses revealed the co-circulation of multiple 
EBOV lineages within individual localities such as Conakry”, as well as 
cross-border virus traffic between Guinea and Sierra Leone", highlight- 
ing important gaps in intervention. On a more localized epidemiolog- 
ical scale, genome sequence data provide a way to reveal who infected 
whom in EBOV transmission networks (although see below). Pathogen 
sequence data can therefore yield key information on the likelihood of, 
for example, sexual transmission’”***, as well as the possible transmis- 
sion of EBOV via breast mill?*, A similarly precise reconstruction of 
transmission chains is essential in understanding the multiple reignition 
events that occurred during the EBOV epidemic and their relation to viral 
transmissions from EVD survivors” (Fig. 3b). Itis unclear whether the 
small subset of EVD survivors that harbour persistent infections pose a 
sustained infection risk or whether an episode of renewed viral replication 
is required for transmission to occur. Considering the pattern and degree 
‘of EBOV genetic change within such cases may provide critical insights. 
Phylogenetic approaches also provide a powerful way to accurately 
estimate various outbreak parameters, such as Ro, including that for 
individual virus lineages that are slow and difficult to obtain using 
longitudinal case data”. Finally, virus ‘super spreaders’ within human 
populations can also be readily identified using pathogen sequence data”, 

Despite the quantity and quality of the viral genome sequence data gen- 
erated during the 2013-2016 EVD epidemic, there are limitations to the 
scope and impact of genomic epidemiology. Clearly, the direct phenotypic 
effects of individual mutations on vaccines, therapeutics and diagnostics 
need to be tested experimentally, However, should viral lineages that differ 
in such properties arise during outbreaks, evolutionary genomic analyses. 
can provide a powerful means to both determine their origins and rapidly 
track their spread through human populations. 


Lessons learned and future directions 

‘The 2013-2016 EVD epidemic has set the benchmark for the use of large- 
scale molecular epidemiology as an essential tool in outbreak response. 
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Given the development of portable sequencing technologies, real-time 
viral genome sequencing is now possible in clinics and diagnostic 
laboratories, including in resource-limited settings”*”®, This will offer 
critical data to inform epidemiological intervention, but will require a 
willingness to invest in scientific infrastructure, healthcare and training 
of local staffin the affected countries™. The need for immediate analysis 
and the growth of open sharing of sequence data means the challenge 
in genomic studies may be moving from data acquisition to analysis 
and interpretation, However, it is also the case that in-country real-time 
sequencing was not established until relatively late in the West African 
epidemic!”*525°°, when case numbers had already begun to decline. In 
addition, many of the genome sequences were obtained in the absence 
of strong clinical and epidemiological metadata, such as the precise 
‘geographical location from where the sample was obtained, whether the 
individual survived the infection, and the time to the onset of symptoms. 
While it may be difficult to obtain such data during a rapidly develop- 
ing outbreak, this limits the usefulness of genomic sequencing data in 
addressing a number of central biological questions, such as the viro- 
logical basis to any variation in disease presentation and the evolution of 
pathogen virulence. An important lesson for the study and management 
of future disease outbreaks is not only that portable sequencing platforms 
should be deployed as rapidly as possible, but that each sequence obtained 
should be linked to as much relevant metadata as is ethically and tech- 
nically possible. 

Despite the insights provided by the analysis of EBOV genome data, 
it is also clear that major questions remain, For future outbreaks it 
will be important to resolve exact chains of transmission (that is, who 
infected whom), as this provides vital information on the patterns and 
mechanisms of virus spread within single communities and hospitals, 
which will help target interventions. Sequence data from the 2013-2016 
EVD epidemic indicated that these chains were difficult to infer using 
the population consensus sequences from individual hosts, although 
in several cases they were shown to be in agreement with epidemio- 
logical studies**?°, Hence, although EBOV evolves rapidly, mutations 
are not necessarily fixed at the scale of individual transmission events, 
which limits phylogenetic resolution. One solution is to examine the 
transmission patterns of intra-host single nucleotide variants (iSNVs) 
(ref. 3), If multiple iSNVs are routinely transmitted between individuals 
(that is, that EBOV is not subject to a severe population bottleneck 
at inter-host transmission) then tracking the inheritance patterns of 
these variants can provide information on how transmission patterns 
exist between individual hosts, as previously shown for influenza 
virus’5%, Importantly, it has been shown that substantial intra-host 
variation can be observed for EBOV, with 2-5 iSNVs per infected 
patient being typical when using a minor allele frequency cutoff of 
5% 2-426, As the cutoff is lowered, the numbers of observed iSNVs 
increase sharply**. 

Genomic studies undertaken in West Africa towards the end of the 
2013-2016 EVD outbreak illustrated how iSNV data can help to resolve 
EBOV transmission pathways, For example, Arias and colleagues showed 
how the analysis of iSNVs from EBOV patients in Sierra Leone could 
provide strong support for sexual transmission from EVD survivors”. 
Determining the number of iSNVs that transmit between hosts can also 
provide key information on the severity of the transmission bottleneck’, 
itself critical for understanding the ability of natural selection to shape 
patterns of genetic diversity. 

Finally, while large-scale EBOV sequence studies have now been under- 
taken in human populations, there is an evident and critical need to deter- 
mine the ecology and evolution of EBOV in its animal reservoir(s). While 
most current data points to bats being the ultimate reservoir host™°, 
long-term studies of EBOV in bats have yet to be performed and it is 
likely that other host species exist, which may have a major bearing on 
epidemiological dynamics. To truly understand the ecology and evolution 
of EBOV, as well as its mechanisms of pathogenicity, will require infor- 
mation on the virus in all its host-virus interactions, and not just those 
associated with EVD outbreaks in humans. 
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The Simons Genome Diversity Project: 
300 genomes from 142 diverse populations 


A list of authors and affiliations appears at the end of the paper. 


Here we report the Simons Genome Diversity Project data set: high quality genomes from 300 individuals from 142 
diverse populations. These genomes include at least 5.8 million base pairs that are not present in the human reference 
genome. Our analysis reveals key features of the landscape of human genome variation, including that the rate 
of accumulation of mutations has accelerated by about 5°% in non-Africans compared to Africans since divergence. 
We show that the ancestors of some pairs of present-day human populations were substantially separated by 100,000 
years ago, well before the archaeologically attested onset of behavioural modernity. We also demonstrate that indigenous 
‘Australians, New Guineans and Andamanese do not derive substantial ancestry from an early dispersal of modern 
humans; instead, their modern human ancestry is consistent with coming from the same source as that of other non- 


Africans. 


To obtain a complete picture of human diversity, it is necessary to 
sequence the genomes of many individuals from diverse locations. To 
date, the largest whole-genome sequencing survey, the 1000 Genomes 
Project, analysed 26 populations of European, East Asian, South Asian, 
American, and sub-Saharan African ancestry". However, this and 
most other sequencing studies have focused on demographically large 
populations. Such studies tend to ignore smaller populations that are 
also important for understanding human diversity. In addition, many 
of these studies have sequenced genomes to only 4~6-fold coverage. 
Here, we report the Simons Genome Diversity Project (SGDP): deep 
genome sequences of 300 individuals from 142 populations chosen 
to span much of human genetic, linguistic, and cultural variation 
(Supplementary Data Table 1). 


Data set and catalogue of novel variants 
We sequenced the samples to an average coverage of 43-fold (range 
34-83-fold) at Illumina Ltd; almost all samples (278) were prepared 
using the same PCR-free library preparation (https://support. 
illumina.com/content/dam/illumina-marketing/documents/services/ 
FastTrackServices_ Methods_Tech_Note.pdf). We aligned reads to 
the human reference genome hs37d5/hg19 using BWA-MEM (BWA- 
0.7.12)? (Supplementary Information section 1). We genotyped each 
sample separately using the Genome Analysis Toolkit (GATK)*, with 
a modification to eliminate bias towards genotypes matching the 
reference (Supplementary Information section 1). We developed a 
filtering procedure that generates a sample-specific mask. At ‘filter 
level 1’ which we recommend for most analyses, we retain an average 
of 2.13 Gb of sequence per sample and identify 34.4 million single 
nucleotide polymorphisms (SNPs) and 2.1 million insertion/deletion 
polymorphisms (indels) (Supplementary Information section 2). 
We have made the GATK-processed data available in a file small 
enough to download by FTP, along with software to analyse these data 
(Supplementary Information section 3). The SGDP data set highlights 
the incompleteness of current catalogues of human variation, with 
the fraction of heterozygous positions not discovered by the 1000 
Genomes Project being 11% in the KhoeSan and 5% in New Guineans 
and Australians (Extended Data Fig, 1; Supplementary Data Table 1). 
We used FermiKit* to map short reads against each other, store the 
assemblies in a compressed form that retains all the information 
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required for polymorphism discovery and analysis, and identified SNPs 
by comparing against the human reference. We find that FermiKit has 
comparable sensitivity and specificity to GATK for SNP discovery 
and genotyping, and is more accurate for indels (Supplementary 
Information section 4). FermiKit also identified 5.8 Mb of contigs that 
are present in the SGDP but absent in the human reference genome 
presumably because they are deleted there; these contigs which we 
have made publicly available can be used as ‘decoys’ to improve read 
mapping (Supplementary Information section 5). Finally, we called 
copy number variants" and used lobSTR®” to genotype 1.6 million short 
tandem repeats (STRs) (Supplementary Information section 6). The 
high quality of the STR genotypes (7 = 0.92 to capillary sequencing 
calls) is evident from their accurate reconstruction of population 
relationships, even for difficult-to-genotype mononucleotide repeats 
(Extended Data Fig. 2). 


The structure of human genetic diversity 
To obtain an overview of population relationships, we carried out 
ADMIXTURES (Extended Data Fig. 3) and principal component 
analysis® (Extended Data Fig. 4a). We also built neighbour-joining 
trees based on pairwise divergence per nucleotide (Fig. 1a) and Fyr 
(Extended Data Fig. 4b) whose topologies are consistent with previous 
findings that the deepest splits among human populations are among 
Africans, We computed heterozygosity—the proportion of diallelic 
genotypes per base pair—and recapitulate previous findings that the 
highest genetic diversity is found in sub-Saharan Africa and that there is, 
a much lower ratio of X-to-autosome diversity in non-Africans than in 
Africans (Fig. 1b)"". A surprise is that African ‘pygmy’ hunter-gatherers 
have reduced X-to-autosome diversity ratios relative to all other sub- 
Saharan Africans. This pattern is just as strong even after we remove 
the third of chromosome X known to be subject to the strongest natural 
selection, suggesting that the finding is driven by demographic history 
rather than by natural selection (Supplementary Information section 7). 
thas been suggested that the reduced X-to-autosome heterozygosity 
ratio in non-Africans is due to ongoing male-driven admixture 
Male non-pygmy admixture into pygmies is well-documented'™", so 
this process could explain these findings. 

Comparisons of ancient to present-day human genomes have shown 
that all non-Africans today possess Neanderthal ancestry" with 
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more in eastern non-Africans!®"*, and that Australo-Melanesians, 
and toa lesser extent other eastern non-Africans, possess Denisovan 
ancestry'”9. However, these studies only analysed genomes from 
ahandful of populations. We computed statistics informative about 
‘Neanderthal and Denisovan ancestry and provide a fine-scale view 
of these ancestry distributions worldwide (Fig. Ic, d; Supplementary 
Data Table 1; Supplementary Information section 8). We do not detect 
any population with a higher proportion of Neanderthal ancestry than 
is present in East Asians. However, we do find suggestive evidence of 
an excess of Denisovan ancestry in some South Asians compared to 
other Eurasians. This signal may not have been detected before because 


15,16 


earlier surveys of archaic introgression largely excluded South Asians 
(Fig. 1d; Supplementary Data Table 1). 


The time course of human population separation 
We studied demographic history by leveraging the fact that variation 
across the genome in divergent sites per base pair can be used to 
reconstruct population size changes and separations, We used the 
pairwise sequential Markovian coalescent (PSMC)*" to reconstruct pop- 
ulation size changes, and the multiple sequentially Markovian coalescent 
(MSMC}*" to study the time course of population separations, We infer 
that the population ancestral to all present day humans began to develop 


Figure I | Genetic variation in 
the SGDP. a, Neighbour-joining 
tree of relationships based on 
pairwise divergence. b, Plot of 
autosomal heterozygosity against 
the X-to-autosome heterozygosity 
ratio, showing the reduction 

in this ratio in non-Africans 

and pygmies. ¢, Estimate of 
Neanderthal ancestry with a heat 
map scale of 03%. d, Estimate of 
Denisovan ancestry with a heat 
map scale of 0-0.5% to bring out 
subtle differences in mainland 
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Eurasia (Oceanian groups with as 
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are saturated in bright red). 
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Figure 2 | Cross-coalescence rates and effective population sizes for 
selected population pairs. a-c, Cross-coalescence rates as a function of 
time in thousands of years ago (kya) estimated using MSMC, with four 
haplotypes per pair. In each subfigure legend, we give the point estimate of 
the date at which 25%, 50% and 75% of lineages in the pair of populations 
have coalesced into a common ancestral population. We generated these 
plots using data phased with the 1000 Genomes reference panel (method 


substructure at least 200 thousand years ago (kya), which is most 
apparent when comparing the ancestors of some present-day African 
hunter-gatherers (southern African KhoeSan and central African 
Mbuti pygmies) to other populations (Fig. 2a). However, itis also clear 
that this substructure developed slowly, as all pairs of present-day 
populations including African hunter-gatherers share a substantial sub- 
set of their ancestors as recently as a hundred thousand years ago”, 
Quoting the time at which MSMC infers that more than 50% (25-75%) 
of lineages for a pair of populations are descended from the same ances- 
tral population, we estimate that non-Africans separated substantially 
from KhoeSan 131 (82-173) kya and almost as anciently from the 
Mbuti around 112 (66-171) kya. Within Africa (Fig, 2a, b), we infer that 


kya (y = 4.3.x 1049) 
PS1 described in Supplementary Information section 9), but only show 
pairs of populations for which the cross-coalescence rates are relatively 
insensitive to the phasing approach. a, Selected African cross-coalescence 
rates, b, Central African rainforest hunter-gatherer cross-coalescence 
rates. ¢, Ancient non-African cross coalescence rates. d-f, Effective 
population sizes inferred using PSMC, using one diploid genome per 
population, for the same populations that we used in 


the Yoruba separated substantially from the KhoeSan 87 (58-120) kya: 
from the Mbuti 56 (32-84) kya; and from the Dinka 19 (9-25) kya. 
We estimate a relatively rapid 21 (21-26) kya separation of northern 
and southern KhoeSan®** potentially reflecting isolation since the 
last glacial maximum; and 38 (27-44) kya separation between western 
(Biaka) and eastern (Mbuti) pygmies, confirming very old substructure 
between these two central African hunter-gatherer groups”. 
Outside Africa, the most ancient structure dates to around 50 kya 
(Fig. 2c) during or shortly after the deepest part of the shared non- 
African bottleneck 40-60 kya, consistent with the archaeological 
evidence of the dispersal of modern humans into Eurasia during 
this period. We are not confident about the estimates of the date of 
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Figure 3 | Present-day populations have negligible ancestry from 

an early dispersal of modern humans out of Africa. Best-fitting 
admixture graph model of relationships among Australians, New 
Guineans, Andamanese and other diverse populations. Present-day 
populations are shown in blue, ancient samples in red, and select inferred 
ancestral nodes in green, Dotted lines indicate admixture events, all of 
which involve archaic humans. All f-statistic relationships are accurately 
fit to within 2.1 standard errors. Inset, results of adding putative early 
dispersal admixture to the graph model for different assumptions about 
when the early lineage split off. We specify the split time in terms of 

the genetic drift above the ‘Non-African’ node, with 0.01 units of drift 
representing on the order of ten thousand years. The (approximate) model 
likelihood is maximized with zero early dispersal ancestry, and no more 
than a few per cent is consistent with the data 


separation of Australians, New Guineans and Andamanese from other 
populations because we find that these inferences change depending 
on the computational method we use for phasing, probably due to 
these populations not being represented in the 1000 Genomes haploid 
genome reference panel (Supplementary Information section 9). We 
caution that the date estimates also do not take into account uncertainty 
about the true value of the human mutation rate, which could plausibly 
be 30% higher or lower than the point estimate we use", 


Early modern human dispersals contributed little 

to non-Africans 

There is intense debate about whether present-day Australians, New 
Guineans and Asian ‘Negrito’ populations are descended from the 
same source population as mainland Eurasians, or whether they also 
derive some ancestry from an early, independent dispersal of mod- 
ern humans into Asia?” To explore this scenario rigorously, we fit 
an admixture graph**—a phylogenetic tree incorporating mixture 
events—to the allele frequency correlations among Neanderthals, 
Denisovans, Upper Paleolithic Europeans, East Asians, New Guineans, 
Australians, and Andamanese, We obtain a good fit to the data if we 
include known Neanderthal and Denisovan introgression and model 
all modern human ancestry in New Guineans, Australians and 
‘Andamanese as part of an eastern clade together with mainland East 
Asians (Supplementary Information section 11; Fig. 3). Furthermore, 
when we manually introduce a deeply diverging modern human lineage 
contributing ancestry to Australians, New Guineans, and Andamanese 
(or when we repeat the analysis ina model without Andamanese), no 
position or proportion of the deep lineage improves the fit. If this 
putative source population branched off the main lineage leading to 
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non-Africans more than about 10-20 thousand years before the sep- 
aration of European and East Asian ancestors, we obtain an upper 
bound of a few per cent for the possible contribution to Australians 
and New Guineans (Fig. 3 inset; Supplementary Information section 
11). These results are at odds with an inference of substantial early 
dispersal ancestry in a previous analysis of an Australian genome*!; 
however, that study used a less complete model that, notably, did not 
include the known Denisovan admixture into Australo-Melanesians"”, 
‘The findings for Australians are also unlikely to be due to some unustal 
feature of the individuals we sequenced, as when we compared three 
different groups of Australian samples for which there is published 
genome-wide data, we found them all to be consistent with descending 
from a common homogeneous population since separation from New 
Guineans (Supplementary Information section 10). These results are 
not in conflict with skeletal and archaeological evidence of an early 
modern human presence outside of Africa??"3, as early migrations 
could have occurred but not contributed substantially to present-day 
populations. The possibility of populations that once flourished but 
did not contribute substantially to living groups is especially plausible 
now that ancient DNA from the ~45 kya Ust’-Ishim”* and the ~40 kya 
Oase 1 individuals™ has documented their existence. 


Accelerated mutation accumulation in non- Africans 

‘The SGDP data provide an opportunity to compare the rates at which 
mutations have accumulated across populations. We restricted our 
analyses to samples for which our genotypes are likely to be most 
reliable (this included restricting to samples which were all processed 
in the same way), and we used the highest level of filtering (‘level 9") 
(Supplementary Information section 7). We pooled samples by region 
to increase power, and for all pairs of regions, computed the expected 
number of positions where, if we picked a random chromosome from 
both, region A would mismatch chimpanzee and region B would be 
identical to chimpanzee (or vice versa). Ifthe rate of accumulation of 
mutation has been the same since the two populations diverged, these 
numbers are expected to be equal", However, when we compute the 
ratio of mutations on one lineage or the other since separation, we 
find a subtle (average of 0.5%) but significant excess of mutations in 
non-Africans relative to sub-Saharan Africans (3.3 < |Z| < 9.4 standard 
errors from zero; Extended Data Table 1). Because any difference must 
reflect events since non-African/ African population divergence, which 
isa less than a tenth of average genetic divergence (Fig. 2a), this implies 
a greater difference in mutation accumulation rates since population 
divergence (~5%). We were concerned that these results might be 
biased by the fact that the human genome reference sequence is more 
closely related to non-Africans than to Africans, or by higher levels of 
heterozygosity in Africans, as both of these issues could make detection 
of divergent sites in Africans more difficult. However, we replicated the 
findings after remapping to chimpanzee, which is equally distant to all 
present populations, and after restricting analyses to the X chromosome 
in males (as males only have a single X chromosome, this procedure 
avoids bias due to different error rates in detecting heterozygous 
genotypes in populations with different rates of heterozygosity) 
(Extended Data Fig. 5). These observations are most likely to be 
explained by acceleration in the rate of mutation accumulation in 
non-Africans, since the same signal appears in comparisons to sub- 
Saharan Africans related in different ways to non-Africans (Extended 
Data Table 1). It is known that the rate of CCT > CTT mutations differs 
across human populations. However, this particular mutation class was 
found to be enriched relative to Africans in Europeans but not in East 
Asians, and thus cannot explain our signal*®, One of several possible 
explanations for these findings is a decrease in the generation interval 
in non-Africans compared to Africans since separation”. 


No species-wide sweeps in modern humans 


Finally, we used the SGDP data set to address the hypothesis that 
the widespread appearance of modern human behaviour in the 
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archaeological record after ~50 kya was driven by one or a few changes 
in neurological genes that swept through the population shortly 
before this time”, We first applied the 3P-CLR method?” to search 
for locations in the genome with low allele frequency differentiation 
between KhoeSan and other modern humans, combined with high dif- 
ferentiation between modern and archaic (Neanderthal and Denisovan) 
humans, as might be expected from a selective sweep in the ancestors of 
all modern humans (Supplementary Information section 12) (Extended 
Data Fig. 6). We found no strong outlier signals, although a caveat is 
that the scan has limited power and we could not apply it to filtered sec- 
tions of the genome, We also applied the PSMC method” to estimate 
the average time since the most recent common ancestor (TMRCA) 
of individuals’ two chromosomes in the genomic regions within the 
largest 3P-CLR peaks (38 peaks corresponding the top 0.1%). In none 
of the regions did we infer that the great majority of all pairs of modern 
humans share a common ancestor <100 kya, as would be expected for 
a sweep just before ~50 kya years ago (Supplementary Data Table 2). 

‘Asa second approach to scanning for species-wide selective sweeps, 
we applied the PSMC to infer TMRCA for SGDP samples across the 
entire genome. This analysis found no regions where the great majority 
of pairs of human genomes are inferred to share a common ancestor 
<100 kya (the largest fraction seen anywhere in the genome is 68%; 
Extended Data Fig. 7). 

‘Taken together, these results do not rule out the possibility that 
genetic changes contributed in a meaningful way to changes in human 
behaviour after 50 kya; for example, changing selection can produce 
shifts in the frequencies of pre-existing mutations to bring a population 
toa new and advantageous set-point fora phenotype as occurred in the 
case of height differences between northern and southern Europeans", 
For polygenic selection, however, genetics is not a creative force, and 
instead responds to selection pressures imposed by novel environmen- 
tal conditions or lifestyles. Thus, our results provide evidence against a 
model in which one or a few mutations were responsible for the rapid 
developments in human behaviour in the last 50,000 years. Instead, 
changes in lifestyles due to cultural innovation or exposure to new 
environments are likely to have been driving forces behind the rapid 
transformations in human behaviour in the last 50,000 years", 
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Extended Data Figure 1 | Heat map of fraction of heterozygous sites missed in the 1000 Genomes project. For each sample, we examine all 
heterozygous sites passing filter level 1, and compute the fraction included as known polymorphisms in the 1000 Genomes project. 
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Extended Data Figure 2 | Worldwide variation in human short tandem 
repeats. a, Mean STR length is reported as the average of the length 
difference (in base pairs) from the GRCh37 reference for each genotype. 
Bubble area scales with the number of calls compared at each point. 

b, c, The first two principal components after performing principal 
component analysis on tetranucleotide and homopolymer genotypes, 
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respectively. Colours represent the region of origin of each sample. 
4, Pairwise Fer values between populations computed using only SNPs 
versus using combined SNP + STR loci, e, Block jackknife standard errors 
for the SNP versus SNP + STR Fr analysis. The red dashed lines give 

the best-fit line, described by the formula in red, The black dashed line 
denotes the diagonal. 
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Extended Data Figure 3 | ADMIXTURE analysis. We carried out 
unsupervised ADMIXTURE 1.23*“* analysis over the 300 SGDP 
individuals in 20 replicates with randomly chosen initial seeds, varying 
the number of ancestral populations between K=2 and K= 12 and using 
default fivefold cross-validation (—cv flag). We used genotypes of at least 
filter level 1, and restricted analysis to sites where at least two individuals 
carried the variant allele (as singleton variants are non-informative for 
population clustering). After further filtering of sites with at least 99% 
completeness and performing linkage-disequilibrium-based pruning 
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in PLINK 1.9" with parameters (indep-pairwise 1000 100 0.2), 
total of 482,515 single nucleotide polymorphisms remained. This figure 
shows the highest likelihood replicate for each value of K. We found 
that log likelihood monotonically increases with K, while the value 

K=5 minimizes cross-validation error (not shown). The solution at 

K=5 corresponds to major continental groups (Sub-Saharan Africans, 
Oceanians, East Asians, Native Americans, and West Eurasians), but 
we show the full range of K here as they illustrate finer-scale population 
structure that may be useful to users of the data, 
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Extended Data Figure 4 | Principal component analysis and neighbour joining tree. a, Principal component analysis. b, Neighbour-joining tree based 
on Fer Values for all populations with at least two samples. 
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D(Afr,nonAfr;Chimp)*100 at transversions 


Extended Data Figure 5 | Fewer accumulated mutations in Africans 
than in non-Africans confirmed by mapping to chimpanzee. 

We compute a statistic D (Population A, Population B, Chimp), measuring 
the difference in the rate of matching to chimpanzee in Population A 
compared to Population B. The evidence of mismatching to chimpanzee is 
seen when we restrict to the male X chromosome to eliminate possible 
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D(Afr,nonAfr;Chimp)*100 at transitions; D(Afr,nonA‘fr;Chimp)=(ABA-BAA)/(ABA+BAA) 
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effects due to differences in heterozygosity across populations, and map to 
the chimpanzee genome which is phylogenetically symmetrically related 

to all present-day humans. We find that in 78 randomly chosen Population 
A= African and Population B 
substitutions show no consistent skew from zero, but transition 
substitutions do. 
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Extended Data Figure 6 | 3P-CLR scan for positive selection, The red line denotes the 99.9% quantile cut-off. The genes in the top five regions are 


labelled. a, Scan for selection on the San terminal branch. b, Scan for selection on the non-San terminal branch, ¢, Scan for selection on the ancestral 
modern human branch. 
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Extended Data Figure 7 | Scan for genomic locations where the great 
majority of present-day humans share a recent common ancestor. 

We carried out PSMC analysis on 40 pairs of haploid genomes chosen to 
sample some of the most deeply divergent present-day human lineages. 
We recorded the time since the most recent common ancestor (TMRCA) 
at each position, and rescaled to obtain an estimate of absolute time 
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(Supplementary Information section 12). a, Distribution across the 
genome of the fraction of TMRCAs below specified date cut-offs. For the 
100 kya cut-off, the maximum fraction observed anywhere in the genome 
is 68%. b, Distribution across the genome of the date T’at which specified 
fractions of sample pairs are inferred to have a TMRCA less than 7. 

¢, Percentile points of the cumulative distribution function of B. 
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Extended Data Table 1 | Fewer accumulated mutations in Africans than in non-Africans 


All AIX Lowest B Highest B 
autosomes chromosome quintile quintile 


Population A Population B Dx100 Z_| Dxi00 —-Z_| Dx100 
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Africa Oceania 
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We compute a statistic D (Population &, Ponulation 8, Chim), measuring the ditference inthe rate of matching to chimpanzee in Population A compared ta Population B. For all the autosomes, we 
observe highly significant signals (3.3 < [Z| < 9.4) of exceas mismatching to chimpanzee in nor-Africans compared to Aicans, using a standard error from a black jack-knife, We highlight 0) > 0.002 
in blue, and 2) > 3 in yellow. The deviations from zero are greatest in subsets ofthe genome where the time since two populations split comprises a relatively larger fraction ofthe total genetic 
divergence ime between the populations; ths is the direction expected fram a mutation accumulation change since divergence, Compared tall the autosames asa baseline, a least squares fit 
indicate that he deviations are 2.2 times higher on chromosome X, 2.0 times higher in the quintile of lowest B-statstic (closest to functionally important regions), and 0.43 times as high inthe quintile 
of lowest 8-statistc (turthest trom functional regions). 
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A genomic history of Aboriginal Australia 


Anna-Sapfo Malaspinas'?*, Michael C. Westaway“*, Craig Muller'*, Vitor C. Sousa***, Oscar Lao®:®*, Isabel Alves?*"*, 
Anders Bergstrém**, Georgios Athanasiadis”, Jade Y. Cheng®", Jacob E. Crawford!" Tim H. Heupink*, Enrico Macholdt””, 
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The population history of Aboriginal Australians remains largely uncharacterized. Here we generate high-coverage 
genomes for 83 Aboriginal Australians (speakers of Pama-Nyungan languages) and 25 Papuans from the New Guinea 
Highlands. We find that Papuan and Aboriginal Australian ancestors diversified 25-40 thousand years ago (kya), suggesting 
pre-Holocene population structure in the ancient continent of Sahul (Australia, New Guinea and Tasmania). However, 
all of the studied Aboriginal Australians descend from a single founding population that differentiated ~10-32 kya. 
We infer a population expansion in northeast Australia during the Holocene epoch (past 10,000 years) associated with 
limited gene flow from this region to the rest of Australia, consistent with the spread of the Pama-Nyungan languages. 
We estimate that Aboriginal Australians and Papuans diverged from Eurasians 51-72 kya, following a single out-of- Africa 
dispersal, and subsequently admixed with archaic populations. Finally, we report evidence of selection in Aboriginal 
Australians potentially associated with living in the desert. 


During most of the last 100,000 years, Australia, Tasmania and New 
Guinea formed a single continent, Sahul, which was separated from 
Sunda (the continental landmass including mainland and west- 
ern island Southeast Asia) by a series of deep oceanic troughs never 
exposed by changes in sea level. Colonization of Sahul is thought 
to have required at least 8-10 sea crossings between islands, poten- 
tially constraining the occupation of Australia and New Guinea by 


earlier hominins', Recent assessments suggest that Sahul was settled 
by 47-55 kya®s (Fig. 1). These dates align with those for the earliest 
evidence for modern humans in Sunda‘. 

The distinctiveness of the Australian archaeological and fossil record 
has led to the suggestion that the ancestors of Aboriginal Australians 
and Papuans (Australo-Papuans’ hereafter) left the African continent 
earlier than the ancestors of present-day Eurasians*, Although some 
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Figure 1 | Aboriginal Australian and Papuan samples used in this study, 
as well as archaeological sites and human remains dated to ~40 kya 
or older in southern Sunda and Sahul. The stars indicate the centroid 
location for each sampling group (sample size in parentheses). Publicly 
available genetic data (see Supplementary Information section $04) used 
asa reference panel in this study are shown as squares. Sites with dated 
human remains are shown as white circles and the archaeological sites 
as black circles. The associated dates can be found in Supplementary 
Information section $03. Grey boundaries correspond to territories 
defined by the language groups provided by the Australian Institute 
of Aboriginal and Torres Strait Islander Studies"®. Sampled Aboriginal 
Australians self-identify primarily as: Yidindji and Gungandji from the 
Cairns region (CAI, 1 = 10, see also Supplementary Information section 
$02); Yupangati and Thanakwithi from northwest Cape York (WPA, 
=6), Wangkangurru and Yarluyandi from the Birdsville region (BDV, 


genetic studies support such multiple dispersals from Africa®, others 
favour only one out-of-Africa (OoA) event, with one or two inde- 
pendent founding waves into Asia, of which the earlier contributed 
to Australo-Papuan ancestry”*. In addition, recent genomic studies 
have shown that both Aboriginal Australian’ and Papuan’ ancestors 
admixed with Neanderthal and Denisovan archaic hominins after 
leaving Africa. 

Increased desertification of Australia'® during the last glacial max- 
imum (LGM) 19-26.5 kya affected the number and density of human 
populations'". In this context, unique morphological and physiological 
adaptations have been identified in Aboriginal Australians living in 
desert areas today", In particular, desert groups were hypothesized to 
withstand sub-zero night temperatures without showing the increase 
in metabolic rates observed in Europeans under the same conditions. 

At the time of European contact, Aboriginal Australians spoke 
over 250 distinct languages, two-thirds of which belong to the Pama~ 
‘Nyungan family and cover 90% of the Australian mainland, The 
place of origin of this language family and the effect of its extensive 
diffusion on its internal phylogenetic structure have been debated", 
but the pronounced similarity among Pama-Nyungan languages, 
together with shared socio-cultural patterns, have been interpreted 
as resulting from a mid-Holocene expansion". Other changes in 
the mid-late Holocene (~4 kya) include the proliferation of backed 
blades and the introduction of the dingo'®. It has been suggested that 
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=10, 9 sequenced at high depth), Barkindji from southeast (RIV, n= 8); 
Pilbara area Yinhawangka and Banjima (PIL, n= 12), Ngaanyatjarra from 
western central desert (WCD, 1 = 13), Wongatha from Western Australia’s 
northern Goldfields (WON, 1 = 11), Ngadju from Western Australia’s 
southern Goldfields (NGA, n=6); and Nyungar from southwest Australia 
(ENY, = 8). Papuans include samples from the locations Bundi (BUN, 

), Kundiawa (KUN, n=5), Mendi (MEN, 1 =5), Marawaka (MAR, 
m=5) and Tari (TAR, n =5). We generated SNP array data (black stars) 
for 45 Papuan samples including 24 Koinambe (KOI) and 15 Kosipe 
(KOS)—described previously!“—and 6 individuals with Highland ancestry 
sampled in Port Moresby (PMO). Lake Carpentaria (LC), which covered a 
significant portion of the land bridge between Australia and New Guinea 
11.5-40 kya and thus potentially acted as a barrier to gene flow, is also 
indicated. Map data were sourced from the Australian Government, 
http://www.naturalearthdata.com/ and our research. 


Pama-Nyungan languages, dingoes and backed blades all reflect the 
same recent migration into Australia”, Although an external origin for 
backed blades has been rejected, dingoes were certainly introduced, 
most likely via island Southeast Asia’®. A recent genetic study found 
evidence of Indian gene flow into Australia at the approximate time 
of these Holocene changes", suggesting a possible association, while 
substantial admixture with Asians and Europeans is well documented 
in historical times”, 

To date, only three Aboriginal Australian whole genome sequences 
have been described—one deriving from a historical tuft of hair from 
‘Australia’s Western Desert® and two others from cell lines with limited 
provenance information”. In this study, we report the first extensive 
investigation of Aboriginal Australian genomic diversity by analysing 
the high-coverage genomes of 83 Pama-Nyungan-speaking Aboriginal 
‘Australians and 25 Highland Papuans. 


Dataset 

We collected saliva samples for DNA sequencing in collaboration 
with Aboriginal Australian communities and individuals in Australia 
(Supplementary Information section S01). We sequenced genomes 
at high-depth (average of 60%, range 20-100) from 83 Aboriginal 
‘Australian individuals widely distributed geographically and linguisti- 
cally (see Fig. 1 and associated legend for the location and label for each 
group as well as Extended Data Table 1, Supplementary Information 
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Figure 2 | Genetic ancestry of Aboriginal Australians in a worldwide 
context. a, Estimation of genomic ancestry proportions for the best 
number of ancestral components (K=7) based on Aboriginal Australian 
and Papuan whole-genome sequence and SNP array data from this study 
(see Fig. 1), and publicly available SNP array data'*?**”*’ (Supplementary 
Information section $05). Each ancestry component has been labelled 
according to the geographic region showing the corresponding highest 
frequency. The area of each pie chart is proportional to the sample size 

(as depicted in the legend). The genomes of Aboriginal Australian populations 
are mostly a mixture of European and Aboriginal Australian ancestry 
components. Northern Aboriginal Australian groups (Arnhem Land, CAI, 
ECCAG, PIL and WPA) are also assigned to components mainly present in 
East Asian populations, while northeastern Aboriginal Australian groups 
(CAL and WPA) also show components mainly present in New Guinean 
populations. A background of 5% ‘Melanesian’ component is observed 

in all the Aboriginal Australian populations; however, this component is, 
widely spread over the geographic area shown in this figure, being present 
from Taiwan to India. We detected on average 1.5% ‘Indian’ component 
and 1.4% ‘Polynesian’ component across the Aboriginal Australian 
samples, but we attribute these residual ancestry components to statistical 
noise as they are present in other Southeast Asian populations and are not 
supported by other analyses (Supplementary Information section $05), 

, Classical Multidimensional scaling (MDS) plot of first two dimensions 


sections S02-S04 for more information). Additionally, we sequenced 
25 Highland Papuan genomes (38-53; Supplementary Information 
sections $03, $04) from individuals representative of five linguistic 
‘groups, and generated genotype data for 45 additional Papuans living 
or originating in the Highlands (Fig. 1). These datasets were combined 
with previously published genomes and SNP array genotype data, 
including Aboriginal Australian data from Arnhem Land, and froma 
human diversity cell line panel from the European Collection of Cell 
Cultures” (ECCAG, Fig. 1, Supplementary Information section $04). 
We explored the extent of admixture in the Aboriginal Australian 
autosomal gene pool by estimating ancestry proportions with an 
approach based on sparse non-negative matrix factorization (SNMF)*!. 
We found that the genomic diversity of Aboriginal Australian popu- 
lations is best modelled as a mixture of four main genetic ancestries 
that can be assigned to four geographic regions based on their relative 
frequencies: Europe, East Asia, New Guinea and Australia (Fig. 2a, 
Extended Data Fig. 1, Supplementary Information section $05). The 
degree of admixture varies among groups (Supplementary Information 
section $05), with the Ngaanyatjarra speakers from central Australia 
(WCD) having a significantly higher ‘Aboriginal Australian compo- 
nent’ (median value= 0.95) in their genomes than the other groups 
sampled (median value= 0.64; Mann-Whitney rank sum test, one- 
tail P=3.55 x 10-7). The East Asian and New Guinean components 
are mostly present in northeastern Aboriginal Australian populations, 
while the European component is widely distributed across groups 
(Fig. 2a, Extended Data Fig. 1, Supplementary Information section 
$05). In most of the subsequent analyses, we either selected specific 
samples or groups according to their level of Aboriginal Australian 
ancestry, or masked the data for the non-Aboriginal Australian ancestry 
genomic components (Supplementary Information section $06). 


based on an identity-by-state (IBS) distance matrix (based on 54,971 
SNPs) between individuals from this study and worldwide populations, 
including publicly available data!*?**”*”, The first two dimensions explain 
19% of the variance in the IBS distance matrix. Individuals are colour- 
coded according to sampling location, grouped into Australia (Arnhem 
Land, ECCAC, BDV, CAI, ENY, NGA, PIL, RIV, WCD, WON, WPA); East, 
Asia (Cambodian, Dai, Han, Japanese, Naxi); Europe (English, French, 
Sardinian, Scottish, Spanish); India (Vishwabrahmin, Dravidian, Punjabi, 
Guaharati); and New Guinea (HGDP-Papuan, Central Province, Eastern 
Highlands, Gulf Province, Highlands, PMO, KOI, KOS, BUN, KUN, MEN, 
TAR, MAR). Stars indicate the centroid for each Aboriginal Australian 
group. Aboriginal Australians from this study as well as from previous 
studies are closest to Papuans and also show signals of admixture with 
Eurasians (see Supplementary Information section $05 for details). 

¢, A heat map displaying outgroup f; statistics of the form f(Mbuti; 
WCD02, X), quantifying genetic drift shared between the putatively 
unadmixed individual WCD02 chosen to represent the Aboriginal 
Australian population, and various populations throughout the broader 
region for which either array genotypes or whole-genome sequencing data 
‘were publicly available or generated in this study. We used 760,116 SNPs 
for which WCD02 had non-missing array genotypes that overlapped with 
any other datasets, Standard errors, as estimated from block jack-knife 
resampling across the genome, were in the range 0.002-0.007. 


Colonization of Sahul 

The origin of Aboriginal Australians is a source of much debate, 
as is the nature of the relationships among Aboriginal Australians, 
and between Aboriginal Australians and Papuans. Using fy 
statistics”, estimates of genomic ancestry proportions and classical 
multidimensional scaling (MDS) analyses, we find that Aboriginal 
‘Australians and Papuans are closer to each other than to any other 
present-day worldwide population considered in our study (Fig. 2b, c, 
Supplementary Information section $05). This is consistent with 
‘Aboriginal Australians and Papuans originating from a common 
ancestral population which initially colonized Sahul. Moreover, out- 
group f; statistics do not reveal any significant differences between 
Papuan populations (Highland Papuan groups sampled in this study 
and the Human Genome Diversity Project (HGDP-Papuans)) in 
their genetic affinities to Aboriginal Australians (Extended Data 
Fig. 2a), suggesting that Papuan populations diverged from one 
another after or at the same time as they diverged from Aboriginal 
‘Australians. 

To investigate the number of founding waves into Australia, we con- 
trasted alternative models of settlement history through a composite 
likelihood method that compares the observed joint site frequency 
spectrum (SES) to that predicted under specific demographic models 
(Fig. 3, Supplementary Information section $07). We compared 
HGDP-Papuans to four Aboriginal Australian population samples 
with low levels of European admixture (Extended Data Fig. 1) from 
both northeastern (CAI and WPA) and southwestern desert (WON 
and WCD) Australia. We compared one- and two-wave models, where 
each Australian region was either colonized independently, or by 
descendants of a single Australian founding population after its diver- 
gence from Papuans. The one-wave model provides a better fit to the 
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Figure 3 | Settlement of Australia. Best supported demographic model of 
the colonization of Australia and New Guinea. The demographic history 
of Aboriginal Australian populations was modelled by considering that 
sampled individuals are from sub-populations (‘islands’) that are part of 
two larger regions (‘continents’), which geographically match the northeast 
and the southwestern desert regions of Australia, Maximum likelihood 
parameter estimates were obtained from the joint SFS of Han Chinese, 
HGDP-Papuans, CAL, WPA, WON and WCD. The 95% CI, obtained by 
non-parametric block bootstrap, are shown within parentheses. Estimated 
migration rates scaled by the effective population size (2Nm) are shown. 
above/below the corresponding arrows, Only Aboriginal Australian 
individuals with low European ancestry were included in this analysis. In 
this model, we estimated parameters specific to the settlement of Australia 
and New Guinea (numerical values shown in black); keeping all the 

other demographic parameters set to the point estimates shown in Fig. 4 
(numerical value shown in grey here). Only admixture events involving 
proportions >0.5% are shown, The inferred parameters were scaled using 
a mutation rate of 1.25 x 10-* per generation per site"! and a generation 
time of 29 years corresponding to the average hunter-gatherer generation 
interval for males and females". See Supplementary Information section 
807 for further details. 


observed SES, suggesting that the ancestors of the sampled Aboriginal 
‘Australians diverged from a single ancestral population. This con- 
clusion is also supported by MDS analyses (Fig. 2b), as well as by 
estimation of ancestry proportion® where all Aboriginal Australians 
form a cluster distinct from the Papuan populations (Extended Data 
Fig. 1, Supplementary Information section $05). Additionally, it is 
supported by outgroup fs analyses, where all Aboriginal Australians 
are largely equidistant from Papuans when adjusting for recent admix- 
ture (Extended Data Fig. 2b). Thus, our results, based on 83 Pama~ 
Nyungan speakers, do not support earlier claims of multiple ancestral 
migrations into Australia giving rise to contemporary Aboriginal 
‘Australian diversity. 

‘The SFS analysis indicates that there was a bottleneck in the ancestral 
Australo-Papuan population ~50 kya (95% confidence intervals (CI) 
35-54 kya, Supplementary Information section $07), which overlaps 
with archaeological evidence for the earliest occupation of both Sunda 
and Sahul 47-55 kya”, We further infer that the ancestors of Pama- 
‘Nyungan speakers and Highland Papuans diverged ~37 kya (95% Cl 
25-40 kya, Fig, 3, Supplementary Information section $07), which is 
in close agreement with results of multiple sequentially Markovian 
coalescent (MSMC) analyses (Extended Data Fig, 2c, Supplementary 
Information section $08), a method estimating cross coalescence rates 
between pairs of populations based on individuals’ haplotypes", This 
result is also in agreement with previous estimates, for example, based 
on SNP array data"®, 
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Archaic admixture 

We characterized the number, timing and intensity of archaic gene- 
flow events using three complementary approaches: SFS-based 
(Supplementary Information section $07), a goodness-of-fit anal- 
ysis combining D-statistics (Supplementary Information section 
509), and a method that infers putatively derived archaic ‘haplotypes’ 
(Supplementary Information section $10). Aboriginal Australian 
and Papuan genomes show an excess of putative Denisovan 
introgressed sites (Extended Data Fig. 3a, Supplementary Information 
section $11), as well as substantially more putative Denisovan- 
derived haplotypes (PDHs) than other non-Africans (Extended Data 
Fig, 3b, Supplementary Information section $10). The number and total 
length of those putative haplotypes vary considerably across samples. 
However, the estimated number of PDHs correlates almost perfectly 
(P-=0.96) with the estimated proportion of Australo-Papuan ancestry 
in each individual (Extended Data Fig. 3c). We found no significant 
difference in the distribution of the number of PDHs or the average 
length of PDHs between putatively unadmixed Aboriginal Australians 
and Papuans (Mann-Whitney U-test, P > 0.05). Moreover, the genetic 
differentiation between WCD and Papuans was also similar for both 
autosomal SNPs and PDHs, with Fgy values around 0.12. Taken together, 
these analyses provide evidence for Denisovan admixture predating the 
population split between Aboriginal Australians and Papuans (see also 
refs 26, 53) and widespread recent Eurasian admixture in Aboriginal 
Australians (Fig. 2a, b, Supplementary Information section $05). By con- 
straining Denisovan admixture as having occurred before the Aboriginal 
‘Australian-Papuan divergence, the SFS-based approach results in an 
admixture estimate of ~4.0% (95% CI 3.3-5.0%, Fig. 4, Supplementary 
Information section S07), similar to that obtained by D-statistics 
(~5%, Supplementary Information section $09). The SES analyses 
further suggest that Denisovan/Australo-Papuan admixture took place 
~44 kya (95% CI 31-50 kya, Supplementary Information section $07), 
a date that overlaps with an estimate from a more recent study". 

‘The SES analysis also provides evidence for a primary Neanderthal 
admixture event (~2.3%, 95% CI 1.1-3.5%) taking place in the ances- 
tral population of all non-Africans ~60 kya (95% CI 55-84 kya, 
Fig, 4, Supplementary Information section $07). Although we cannot 
estimate absolute dates of archaic admixture from the lengths of PDHs 
and putative Neanderthal-derived haplotypes (PNHs) in our samples, 
we can obtain a relative date. We found that, for putatively unadmixed 
Aboriginal Australians and HGDP-Papuans, the average PNH and 
PDH lengths are 33.8 kb and 37.4 kb, respectively (Extended Data 
Fig. 3b). These are significantly different from each other (P=9.65x10-° 
using a conservative sign test), and suggest that the time since 
Neanderthal admixture was about 11% greater than the time since 
Denisovan admixture, roughly in line with our SFS-based estimates 
for the Denisovan pulse (31-50 kya, Fig, 4) versus the primary pulse 
of Neanderthal admixture (55-84 kya), The SES analysis also indicates 
that the main Neanderthal pulse was followed by a further 1.1% (95% 
CI0.2-2.7%, Fig. 4, Supplementary Information section $07) pulse of 
‘Neanderthal gene flow into the ancestors of Eurasians. Finally, using 
our SES- and haplotype-based approaches, we explored additional 
models involving complex structure among the archaic populations. 
We found suggestive evidence that the archaic contribution could 
be more complex than the model involving the discrete Denisovan 
and Neanderthal admixture pulses*® shown in Fig. 4 (Supplementary 
Information sections $07, $10). 


Out of Africa 

To investigate the relationship of Australo-Papuan ancestors with 
other world populations, we computed D-statistics of the form 
((H1 = Aboriginal Australian, H2 = Eurasian), H3 = African) and 
((H1 = Aboriginal Australian, H2 = Eurasian), H3 = Ust’-Ishim). 
Several of these were significantly positive (Supplementary Information 
section $09), suggesting that Africans and Ust'-Ishim—the 45,000-year- 
old remains of a modern human from Asia””—are both closer to 


part of Springer Nature, All rights reserved. 


xpacted under the 1 008 expat nar tne 2 0k 


fin fk 


Adriane Spltvboteneck 
times ye) times ye) 
teu ve2-t70) “Ghosts 
haar 7 
(60-108) “> potteneck 
oay F ytas) se Australas 
em) sit 
0-80) Pozen ae-58) ‘soit 
ors | 25 
ope 
a East Asa 
29-85) ‘it 
Present Present 
Westies Grest Europe East Asia Australia 
(Saciians) Han Chinese) (CO) 
> Nonnderha 4p Migration, 20m > 1 Arcos bttenack 


J enisovan orate 


ig Comtnet-specte 


— Maton, 2 <1 eens 


Figure 4 | Out of Africa, We used a likelihood-based approach to 
investigate whether the joint SES supports the one-wave (1 OoA) or two- 
wave (2 OoA) scenarios. The maximum likelihood estimates are indicative 
of which scenario is best supported. As shown on the top left inset, under 
the 1 OoA scenario we expect (i) the presence of an ancestral bottleneck 
(in black); (ii) a relatively large Neanderthal admixture pulse shared by the 
ancestors of all non-Africans; and (iti) overlapping divergence times 

of the ancestors of Aboriginal Australians and Eurasians. In contrast, the 
top right inset shows parameters expected under a2 OoA scenario: 

(i) a limited/absent ancestral bottleneck (in black) in the ancestors of all 
non-Africans; (ii) no shared Neanderthal admixture in the ancestors of 
all non-Africans; (iii) distinct divergence times for Aboriginal Australians 
and Eurasians, The main population tree shows the best fitting topology, 
which supports the 1 OoA scenario, and maximum likelihood estimates 
(MLEs) for the divergence and admixture times and the admixture 
proportions (with 95% Cl obtained by non-parametric block bootstrap 
shown within parentheses). We assume that the OoA event is associated 
with the ancestral bottleneck. The ‘Ghost’ population represents an 
unsampled population related to Yoruba that is the source of the out-of- 
Africa event(s), Our results suggest that these two African populations 
split significantly earlier (~125 kya) than the estimated time of dispersals, 
into Eurasia. Note that under a 1 OoA scenario, this ghost population 
becomes, after the ancestral bottleneck, the ancestral population of all 
non-Africans that admixed with Neanderthals. Arrow thicknesses are 
proportional to the intensity of gene flow and the admixture proportions, 
and only admixture events involving proportions >0.5% are displayed. 
‘The inferred parameters were scaled as for Fig. 3. See Supplementary 
Information section $07 for further details. 


Eurasians than to Aboriginal Australians. These findings are in 
agreement with a model of Eurasians and Australo-Papuan ancestors 
dispersing from Africa in two independent waves. However, when cor- 
recting for a moderate amount of Denisovan admixture, Aboriginal 
‘Australians and Eurasians become equally close to Ust’-Ishim, as 
expected in a single OoA scenario (Supplementary Information sec- 
tion $09), Similarly, the D-statistics for ((H1 = Aboriginal Australian, 
H2= Eurasian), H3= African) became much smaller after correcting 
for Denisovan admixture. Additionally, a goodness-of-fit approach 
combining D-statistics across worldwide populations indicates stronger 
support for two waves OoA, but when taking Denisovan admixture into 
account, a one-wave scenario fits the observed D-statistics equally well 
(Extended Data Fig. 4a, b, Supplementary Information section $09). 
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To investigate the timing and number of OoA events giving rise 
to present-day Australo-Papuans and Eurasians further, we used the 
observed SFS in a model-based composite likelihood framework”. 
When considering only modern human genomes, we find evidence 
for two waves OoA, with a dispersal of Australo-Papuans ~14,000 
years before Eurasians (Supplementary Information section S07). 
However, when explicitly taking into account Neanderthal and 
Denisovan introgression into modern humans’, the SFS analysis sup- 
ports a single origin for the OoA populations marked by a bottleneck 
~72 kya (95% CI 60-104 kya, Fig, 4, Supplementary Information 
section 07). This scenario is reinforced by the observation that the 
ancestors of Australo-Papuans and Eurasians share a 2.3% (95% CI 
1.1-3.5%) Neanderthal admixture pulse. Furthermore, modern 
humans have both a linkage disequilibrium decay rate and a number 
of predicted deleterious homozygous mutations (recessive genetic load) 
that correlate with distance from Africa (Supplementary Information 
sections $05, $11, Extended Data Fig. 5), again consistent with a single 
‘African origin. 

‘The model best supported by the SES analysis also suggests an early 
divergence of Australo-Papuans from the ancestors of all non-A fri 
in agreement with two colonization waves across Asia® 
our best model, Australo-Papuans began to diverge from Eurasians 
~58 kya (95% CI51-72 kya, Fig, 4, Supplementary Information section 
$07), whereas Europeans and East Asians diverged from each other 
~42 kya (95% CI 29-55 kya, Fig. 4, Supplementary Information 
section $07), in agreement with previous estimates”'*"8. We find 
evidence for high levels of gene flow between the ancestors of Eurasians 
and Australo-Papuans, suggesting that, after the fragmentation of the 
oA population (‘Ghost’ in Fig. 4) 57-58 kya, the groups remained in 
close geographical proximity for some time before Australo-Papuan 
ancestors dispersed eastwards. Furthermore, we find evidence for 
gene flow between sub-Saharan Africans and Western Eurasians after 
~42 kya, in agreement with previous findings’. 

MSMC analyses suggest that the Yoruba/Australo-Papuans and the 
Yoruba/Eurasians cross-coalescence rates are distinct, implying that the 
Yoruba and Eurasian gene trees across the genome have, on average, 
more recent common ancestors (Extended Data Fig, 4c, Supplementary 
Information section $08). We show through simulations that these dif- 
ferences cannot be explained by typical amounts of archaic admixture 
(<20%, Extended Data Fig. 4d). Moreover, the expected difference in 
phasing quality among genomes is not sufficient to explain this pat- 
tern fully (Supplementary Information section S08). While a similar 
separation in cross coalescence rate curves is obtained when compar- 
ing Eurasians and Australo-Papuans with Dinka, we find that, when 
comparing Australo-Papuans and Eurasians with San, the cross coa- 
lescence curves overlap (Extended Data Fig. 4c). We also find that the 
inferred changes in effective population size through time of Aboriginal 
Australians, Papuans, and East Asians are very similar until around 
50 kya, including a deep bottleneck around 60 kya (Extended Data 
Fig, 6a). Taken together, these MSMC results are consistent with a split 
of both Australo-Papuans and Eurasians from a single African ancestral 
population, combined with gene flow between the ancestors of Yoruba 
or Dinka (but not San) and the ancestors of Eurasians that is not shared 
with Australo-Papuans. These results are qualitatively in line with the 
SFS-based analyses (see Fig. 4). While our results do not exclude the 
possibility of an earlier OoA expansion, they do indicate that any such 
event left little trace in the genomes of modern Australo-Papuans, 
in line with conclusions from related work appearing alongside this, 


study 


Genetic structure of Aboriginal Australians 

Uniparental haplogroup diversity in this dataset (Extended Data 
Table 1, Supplementary Information section $12) is consistent with 
previous studies of mitochondrial DNA (mtDNA) and Y chromosome 
variation in Australia and Oceania®, including the presence of typically 
European, Southeast and East Asian lineages”. The combined results 
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provide important insights into the social structure of Aboriginal 
Australian societies. Aboriginal Australians exhibit greater between- 
group variation for mtDNA (16.8%) than for the Y chromosome 
(11.3%), in contrast to the pattern for most human populations*. This 
result suggests higher levels of male- than female-mediated migration, 
and may reflect the complex marriage and post-marital residence 
patterns among Pama-Nyungan Australian groups™. As expected 
(Supplementary Information section $02), the inferred European 
ancestry for the Y chromosome is much greater than that for mtDNA 
(31.8% versus 2.4%), reflecting male-biased European gene flow into 
Aboriginal Australian groups during the colonial era. 

On an autosomal level, we find that genetic relationships 
within Australia reflect geography, with a significant correlation 
(reex.ce0=0.77, P <0.0005, Extended Data Fig. 7b) between the first 
two dimensions of an MDS analysis on masked genomes and geograph- 
ical location (Supplementary Information section $13). Populations 
from the centre of the continent occupy genetically intermediate 
positions (Extended Data Fig. 7a, b). A similar result is observed 
with an Fs;-based tree for the masked genomic data (Extended Data 
Fig. 7c, Supplementary Information section $05) as well as in anal- 
yses of genetic affinity based on f, statistics (Extended Data Fig. 2a), 
suggesting a population division between northeastern and southwest- 
ern groups. This structure is further supported by SFS analyses showing 
that populations from southwestern desert and northeastern regions 
diverged as early as ~31 kya (95% CI 10-32 kya, Fig. 3), followed by 
limited gene flow (estimated scaled migration rate (2Nm) ~0.01, 95% 
CI.0,00-11.25), An analysis of the major routes of gene flow within 
the continent supports a model in which the Australian interior acted 
asa barrier to migration. Using a model inspired by principles of elec- 
trical engineering where gene flow is represented as a current flowing 
through the Australian continent and using observed Fsr values as a 
proxy for resistance, we infer that gene flow occurred preferentially 
along the coasts of Australia (Extended Data Fig. 7e~g, Supplementary 
Information section $13). These findings are consistent with a model 
of expansion followed by population fragmentation when the extreme 
aridity in the interior of Australia formed barriers to population move- 
ments during the LGM. 

We used MSMC on autosomal data and mtDNA Bayesian skyline 
plots® (BSP) to estimate changes in effective population size within 
Australia. The MSMC analyses provide evidence of a population 
expansion starting ~10 kya in the northeast, while both MSMC and 
BSP indicate a bottleneck in the southwestern desert populations 
taking place during the past ~10,000 years (Extended Data Fig. 6, 
Supplementary Information sections $08, $12). This is consistent with 
archaeological evidence for a population expansion associated with 
significant changes in socio-economic and subsistence strategies in 
Holocene Australia®®, 

European admixture almost certainly had not occurred before the 
late 18th century, but earlier East Asian and/or New Guinean gene 
flow into Australia could have taken place. We characterized the mode 
and tempo of gene flow into Aboriginal Australians using three differ- 
ent approaches (Supplementary Information sections $06, $07, $14). 
We used approximate Bayesian computation (ABC) to compare the 
observed mean and variance in the proportion of European, East Asian 
and Papuan admixture among Aboriginal Australian individuals to that 
computed from simulated datasets under various models of gene flow. 
We estimated European and East Asian admixture to have occurred 
approximately ten generations ago (Supplementary Information section 
$14), consistent with historical and ethnographic records. Consistent 
with this, a local ancestry approach suggests that European and East 
Asian admixture is more recent than Papuan admixture (Extended 
Data Fig. 8, Supplementary Information section 06). In addition, 
both ABC and SFS analyses indicate that the best-fitting model for 
the Aboriginal Australian-Papuan data is one of continuous but 
modest gene flow, mostly unidirectional from Papuans to Aboriginal 
Australians, and geographically restricted to northeast Aboriginal 
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Australians (2Nm=0.41, 95% CI 0.00-20.35, Fig. 3, Supplementary 
Information section $07). 

To investigate gene flow from New Guinea further, we conducted 
analyses on the Papuan ancestry tracts obtained from the local ancestry 
analysis, We inferred local ancestry as the result of admixture between 
four components: European, East Asian, Papuan and Aboriginal 
Australian (Supplementary Information section S06). The Papuan 
tract length distribution shows a clear geographic pattern (Extended 
Data Fig. 8b); we find a significant correlation of Papuan tract length 
variance with distance from WCD to other Aboriginal Australian 
groups (r=0.64, P< 0.0001). The prevalence of short ancestry tracts 
of Papuan origin, compared to longer tracts of East Asian and European 
origin, suggests that a large fraction of the Papuan gene flow is much 
older than that from Europe and Asia, consistent with the ABC anal- 
ysis (Supplementary Information section $14). We also investigated 
possible South Asian (Indian-related) gene flow into Aboriginal 
‘Australians, as reported recently'®, However, we found no evidence of 
a. component that can be uniquely assigned to Indian populations in 
the Aboriginal Australian gene pool using either admixture analyses 
or fs and D-statistics (Supplementary Information section $05), even 
when including the original Aboriginal Australian genotype data from 
‘Arnhem Land. The different size and nature of the comparative datasets 
may account for this discrepancy. 


Pama—Nyungan languages and genetic structure 

To investigate whether linguistic relationships reflect genetic rela- 
tionships among Aboriginal Australian populations, we inferred 
a Bayesian phylogenetic tree for the 28 different Pama-Nyungan 
languages represented in this sample"? (Extended Data Table 1, 
Supplementary Information section $15). The resulting linguistic 
and Fer-based genetic trees (Extended Data Fig. 7c, d) share several 
well-supported partitions. For example, both trees indicate that the 
northeastern (CAI and WPA) and southwestern groups (ENY, NGA, 
WCD and WON) form two distinct clusters, while PIL, BDV and 
RIV are intermediate. A distance matrix between pairs of languages, 
computed from the language-based tree, is significantly correlated 
with geographic distances (rczo1.an= 0.83, Mantel test two-tail P 
on 9,999 permutations = 0.0001, Supplementary Information section 
$13). This suggests that differentiation among Pama-Nyungan lan- 
guages in Australia follows geographic patterns, as observed in other 
language families elsewhere in the world’, Furthermore, we find a 
correlation between linguistics and genetics (r¢:y,,4"=0-43, Mantel 
test P < 0.0005, Supplementary Information section $13) that remains 
significant when controlling for geography (rrx,.an.ceo=0.26, partial 
Mantel test P <0.0005, Supplementary Information section $13). This is 
consistent with language differentiation after populations lost (genetic) 
contact with one another, The correlation between the linguistic and 
‘genetic trees is all the more notable given the difference in time scales: 
the Pama-Nyungan family is generally accepted to have diversified 
within the last 6,000 years”, while the genetic estimates are two to 
five times that age. The linguistic tree thus cannot simply reflect initial 
population dispersals, but rather reflects a genetic structure that has a 
complex history, with initial differentiation 10-32 kya, localized pop- 
ulation expansions (northeast) and bottlenecks (southwest) ~10 kya, 
and subsequent limited gene flow from the northeast to the southwest. 
The latter may be the genetic signature that tracks the divergence of the 
Pama-Nyungan language family, 


Selection in Aboriginal Australians 

To identify selection signatures specific to Aboriginal Australians, 
we used two different methods based on the identification of SNPs 
with high allele-frequency differences between Aboriginal Australians 
and other groups, similar to the population-branch statistics*® (PBS, 
Supplementary Information section $16). First, we scanned the 
‘Aboriginal Australian genomes for loci with unusually large changes 
in allele frequency since divergence from Papuans, taking recent 
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admixture with Europeans and Asians into account (‘global scan’). 
Second, we identified genomic regions showing high differentiation 
associated with different ecological regions within Australia (‘local 
scan, Supplementary Information section $16). Among the top ranked 
peaks (Extended Data Table 2) we found genes associated with the 
thyroid system (NETO1, seventh peak in the global scan, and KCN]2, 
first peak in the local scan) and serum urate levels (eighth peak in the 
global scan). Thyroid hormone levels are associated with Aboriginal- 
‘Australian-specific adaptations to desert cold? and elevated serum 
urate levels with dehydration“. These genes are therefore candidates 
for potential adaptation to life in the desert. However, further studies 
are needed to associate putative selected genetic variants with specific 
phenotypic adaptations in Aboriginal Australians. 


Discussion 

Australia has one of the longest histories of continuous human occu- 
pation outside Africa, raising questions of origins, relatedness to other 
populations, differentiation and adaptation. Our large-scale genomic 
data and analyses provide some answers but also raise new questions. 
We find that Aboriginal Australians and Eurasians share genomic 
signatures of an OoA dispersal—a common African ancestor, a bot- 
tleneck and a primary pulse of Neanderthal admixture. However, 
Aboriginal Australian population history diverged from that of other 
Eurasians shortly after the OoA event, and included private admixture 
with another archaic hominin. 

Our genetic-based time estimates are relative, and to obtain absolute 
dates we relied on two rescaling parameters: the human mutation rate 
and generation time (assumed to be 1.25% 10~* per generation per site 
and 29 years, respectively, based on recent estimates”), Although the 
absolute estimates we report would need to be revised if these param- 
eters were to change, the current values can be the starting point of 
future research and should be contextualized. 

We find a relatively old divergence between the ancestors of Pama- 
‘Nyungan speakers and Highland Papuans, only ~10% younger than 
the European—East Asian split time. With the assumed rescaling param- 
eters this corresponds to ~37 kya (95% CI 25-40 kya), implying that 
the divergence between sampled Papuans and Aboriginal Australians 
is older than the disappearance of the land bridge between New Guinea 
and Australia ~7—14.5 kya, and thus suggests ancient genetic struc- 
ture in Sahul. Such structure may be related to palaeo-environmental 
changes leading up to the LGM. Sedimentary studies show that the 
large Lake Carpentaria (500 x 250km, Fig. 1) formed ~40 kya, when 
sea levels fell below the 53-m-deep Arafura Sill’, Although Australia 
and New Guinea remained connected until the early Holocene, the 
flooding of the Carpentaria basin and its increasing salinity may have 
thus promoted population isolation, 

Our results imply that Aboriginal Australian groups are the descend- 
ants of the ancestral population that first colonized Australia“, 
They also indicate that the population that diverged from Papuans 
~37 kya was ancestral to all Aboriginal Australian groups sampled 
in this study; yet, archaeological evidence shows that by 40-45 kya, 
humans were widespread within Australia (Fig. 1). Three non-exclu- 
sive scenarios could account for this observation: (1) the Aboriginal 
‘Australian ancestral population was widespread before the divergence 
from Papuans, maintaining gene flow across the continent; (2) it was 
deeply structured, and only one group survived to give rise to mod- 
ern Aboriginal Australians; and (3) other groups survived, but the 
descendants are not represented in our sample. Additional genomes, 
especially from Tasmania and the non-Pama—Nyungan regions of the 
Northern Territory and Kimberley, as well as ancient genomes pre- 
dating European contact in Australia and other expansions across 
Southeast Asia”, may help resolve these questions in the future. 


Online Content Methods, along with any additional Extended Data display ites and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only inthe online paper. 
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Extended Data Figure 1 | See next page for caption, 
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Extended Data Figure 1 | Per-individual admixture proportions of 
K=7 ancestral components including Aboriginal Australians, New 
Guineans, Europeans, Africans, Melanesians and Polynesians. The 
genome of each individual is depicted as a bar and is coloured according 
to the estimated genome-wide proportions of ancestry components. An 
unrooted tree showing the relationships between the identified ancestral 
components is also estimated by our method, Each ancestry has been 
labelled with the name of the population (see also map) showing the 
highest fraction of that ancestral component. The cross-validation 


error is minimized for this value of K for fivefold cross-validation. The 
rooted tree supports the shared genetic origin of Aboriginal Australians, 
Papuans and Bougainvilleans, Note that only individuals with more 

than 50% of Aboriginal Australian ancestry in their genomes (defined in 
Supplementary Information section $06) were included in the analyses. 
Refer to ref. 58 and Supplementary Information section S05 for details, 
about the method and the analysis, Map data ©2016 Google, INEGI. Tree 
constructed with http://jade-cheng.com/trees/ 
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Extended Data Figure 2 | Genetic relationships of Aboriginal 
Australians and Papuans. a, Genetic affinities between a western 

central desert (WCD02) genome and Aboriginal Australians and 
Papuans. Outgroup f; statistics between WCD02 and all other Aboriginal 
Australians and Highland Papuan individuals that were whole-genome 
sequenced for this study, using the genotypes called from the sequencing 
data. Because the widespread recent admixture in Aboriginal Australians 
has large confounding effects on the fy statistics, the values were adjusted 
using the slope coefficient from a simple linear regression model fitted 
to the relationship between fs and the fraction of non-indigenous (that 
is, neither Aboriginal Australian nor Papuan) ancestry in each individual 
genome, The adjusted f; statistics display a genetic gradient that separates 
‘western and eastern Aboriginal Australian populations. However, we 
find no differences between Papuan population samples in their level of 
Aboriginal Australian affinity (Kruskal-Wallis test, P= 0.083). Horizontal 
lines correspond to +1 standard error. b, Genetic affinities between a 
Papuan highlander genome and Aboriginal Australians and Papuans. 
‘The Papuan highlander sample MAROI from the Marawaka area was 
arbitrarily chosen as a reference point for this analysis. fs values were 
adjusted for recent admixture as in a. All Aboriginal Australian groups 
display a similar level of Highland Papuan affinity (with the exception 


of three outlier individuals from the north-eastern WPA and CAL 
populations: WPA06, WPAOS and CAIIO, the latter two of which are 
known to have at least one parent with origins in Papua New Guinea or the 
‘Torres Strait Islands). While some differences between groups are actually 
statistically significant (Kruskal-Wallis test, P= 0.0002, after removing, 
the three outliers), which could be consistent with, for example, low 

levels of Papuan gene flow into some Aboriginal Australian groups (see 
Supplementary Information sections $06 and S07), we caution that some 
of these differences are probably due to imperfect adjustment for Eurasian 
admixture (the adjusted fs is highest in the WCD population, which has 
the least Eurasian admixture). Horizontal lines correspond to +1 standard 
error. ¢, MSMC analyses. Linear interpolation through the midpoints of 
the time intervals of the relative cross coalescence rate estimates from 
MSMC** using pairs of individuals including one HGDP-Papuan and one 
other individual as indicated. We used CAIO1, PILO6, WCD01, WONO3 
and an ECCAC sample for this analysis (see Supplementary Information 
section $08 for details). The MSMC results were scaled using a mutation 
rate of 1.25%10-* per generation per site as suggested in ref. 41 anda 
generation time of 29 years, corresponding to the average hunter-gatherer 
generation interval for males and females. 
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Extended Data Figure 3 | Introgressed archaic sites and putative 
Denisovan and Neanderthal haplotypes. a, Distribution of number 
of putative introgressed sites per individual from archaic humans. 
‘The number of Neanderthal-specific introgressed sites per individual 
increases from Europe to Australia, and then decreases in Amerindians, 
which is consistent with recurrent Neanderthal (or Neanderthal-related 
archaic) gene flow during the expansion into Eurasia, Our results are 

thus indicative of several pulses of Neanderthal gene flow into modern 
humans, as inferred previously“. We note, however, that the apparent 
high levels of Neanderthal-specific introgressed sites in Australo-Papuans 
can be explained by the expected number of misclassified Neanderthal 
introgressed sites resulting from the shared ancestry with Denisovans 
(see Supplementary Information section $11 for details). b-e, Putative 
Denisovan (PDH) and Neanderthal haplotypes (PNH). The putative 
haplotypes correspond to clusters (four or more SNPs spanning at le 


4kb) of heterozygous or homozygous genotypes in complete linkage 
disequilibrium (‘diplotypes’) that are potentially the result of Neanderthal 
or Denisovan admixture. Those diplotypes are homozygous ancestral in 
10 Africans, homozygous derived in the Denisovan for the PDH 
(respectively Neanderthal for the PNH), homozygous ancestral in the 
‘Neanderthal for the PDH (respectively Denisovan for the PNH), and 
with the derived allele segregating in all other contemporary non-African 
humans (see Supplementary Information section $10 for details). We 
report the average number of PDHs and PNHs (b), the correlation 
between the estimated amount of Australo-Papuan ancestry (see Fig. 2a, 
Extended Data Fig. 1, Supplementary Information section $05) and the 
number of identified PDHs for each Australian sample (c), the sum of 
the lengths (d) and the average length (e) of the PDHs and PNHs per 
individual for worldwide populations included in our reference panel 
(see Supplementary Information section S04), 
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Extended Data Figure 4 | Out of Africa: admixture graphs based on 
D-statistics and MSMC analyses. a, Admixture graphs representing 

some of the topologies considered for the two waves and one wave Out of 
Africa models assuming Denisovan admixture. All topologies are identical 
except for the coloured lineages representing Australo-Papuans (green), 
Neanderthal (Nea, orange) and Denisovan (Den, blue). The graphs differ 
in (1) the number of OoA events, and (2) the number of Neanderthal 
admixture pulses. Png, HGDP-Papuan. b, Sum of squared errors between 
the observed D-statistics and the expectations for each quartet in the 
graph involving the chimpanzee as an outgroup for each of the admixture 


graphs shown in a and the corresponding four without Denisovan 
admixture. Each point is the result of the optimization procedure with a 
different starting point. See Supplementary Information section S09 for 
details. ¢, Relative cross coalescence rate (CCR) estimates from MSMC? 

for pairs of individuals including one African sample (Yoruba, Dinka and 
San) and one other, as indicated in the legend. d, Simulation study to assess 
the effect of archaic admixture on the CCR rates, Relative CCR estimated 
for data simulated under a simple two-population divergence model 

where one of the populations admixed at different rates with an archaic 
population, See Supplementary Information section S08 for details. 
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Extended Data Figure 5 | Inferred deleterious mutations. a, Box plot 

of the number of derived homozygous sites per individual for worldwide 
populations that are predicted to be deleterious. Deleteriousness of SNPs 
was inferred using genomic evolutionary rate profiling (GERP) rejected 
substitution scores. Derived alleles with a rejected substitution score larger 
than 2 were considered to be deleterious, see Supplementary Information 
section S11. b, ¢, Average rejected substitution score per individual 
calculated across heterozygous sites (b), and derived homozygous sites (c). 
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one copy of the derived allele. Solid lines show the linear regression of 
homozygosity against average rejected substitution score per individual for 
non-African modern humans. Dashed lines indicate the 95% confidence 
interval for the linear regression. See Supplementary Information S11 for 
details. 
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Extended Data Figure 6 | Effective population size changes over time. 
, Population size estimates from MSMC for pairs of individuals from 
several populations within and outside of Australia. For each run, we used 
two individuals from each population, that is, four haplotypes in each 
run, MSMC results were scaled as in Fig. 3. b, Bayesian skyline plots (BSP) 
calculated from the mtDNA genome sequences, showing the effective 


vyears 260 
population size estimates over time when considering either groups 
from northeastern Australia (CAI, WPA) or groups from southwestern 
Australia (ENY, NGA, WCD, WON). Solid lines are the estimates, dashed 
lines are the corresponding 95% credible intervals (see Supplementary 
Information section $12). 
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Extended Data Figure 7 | See next page for caption, 
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Extended Data Figure 7 | Genetics mirrors geography and languages. 
a,b, Procrustes analyses of the first two dimensions of a classical 
multidimensional scaling (MDS) analysis of the Aboriginal Australian 
genome sequences (autosomes). We considered two cases: an analysis 
including all variants (a), or only the variants remaining after genomi 
regions of putative recent European and East Asian origin are ‘maske 
(b, Supplementary Information section S06). Both MDS plots have been 
rotated towards the best overlap with geographic sampling locations as 
defined by Procrustes analysis’, In each plot, the connecting lines indicate 
the error of the MDS coordinates towards the assigned population- 
sampling geographic coordinates. We find that the genetic relationships 
within Australia mirrors geography, with a significant correlation for both 
cases, that is, rcex.cz0 = 0.59, P < 0.0005 for all variants and even higher, 
Toxx,ce0 = 0.77, P-<0.0005, for the masked data. We find using the 

bearing correlogram approach that the main axis of genetic differentiation 
in the masked Aboriginal Australian genomes is at an angle of 65° 
compared to the equator, that is, in the southwest to northeast direction 
(Supplementary Information section S13). ¢, d, Correspondence between 
genetics and linguistics. ¢, Unrooted neighbour-joining Fsr-based genetic 
tree (cladogram). Weir and Cockerham Fsr distance was computed 
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between the Aboriginal Australian populations after masking the Eurasian 
tracts, Statistical robustness of each branch was estimated by means of a 
bootstrap analysis (1,000 replicates, Supplementary Information section 
805). d, Bayesian phylogenetic tree for the 28 different Pama-Nyungan 
languages represented in this sample (from ref. 13, see Supplementary 
Information section $15). Posterior probabilities are also indicated. Note 
that one language group can be shared by different Aboriginal Australian 
groups. The linguistic tree was built with BEAST“, e~g, Gene flow across 
the continent, e, Mantel non-parametric r (estimating the goodness of 

fit between genetic differentiation and connectivity) versus ratios of 
resistance of inland to coastal nodes, showing a peak at 1.7. £, Best fit 

of pairwise population genetic differentiation, Fer (computed between 

the nine Aboriginal Australian groups after masking Eurasian tracts 
(Supplementary Information section $06)), versus pairwise connectivity 
based on the environment (estimated as resistance) when moving inland 
is 1.7 times harder than moving along coastal nodes. g, Gene flow across 
the Australian landscape, quantified as the cumulative current for pairwise 
connections among Aboriginal Australian groups (black circles), with 
larger current (warmer colours) representing greater gene flow. 
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Extended Data Figure 8 | European, East Asian and Papuan genomic 
tracts in Aboriginal Australians. a, Distribution of the tracts assigned to 
Aboriginal Australian (WCD), Papuan, East Asian or European ancestry 
for 58 unrelated non-WCD Aboriginal Australian samples. Most of the 
shorter tracts were of Papuan origin, suggesting that a large fraction of 
the Papuan gene flow is much older than that from Europe and East Asia, 
consistent with a Papuan influence spreading slowly from northeastern 
to southwestern Australia by ancient migration. b, Corresponding scatter 
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plot with fitted line of per-individual variance in Papuan tract length 
versus geographic distance from WCD, the latter calculated using the 
great-circle distance formula for pairs of individual GPS coordinates. 
Papuan tract distribution showed a strong and significant correlation with 
distance from WCD (r= 0.64; P< 1 x 10”), with ‘younger tracts’ (that is, 
with a larger variance) closer to New Guinea and ‘older tracts’ (that is, with 
asmaller variance) closer to WCD. This is also consistent with continuous 
Papuan gene flow spreading from the northeast. 
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Extended Data Table 2 | Whole genome sequence depth of coverage, haplogroup and language assignments for the Aboriginal Australian 
samples 
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“The depth ot coverage (DaC) is the average number of reads covering every position in the genome (hgl) after duplicate removal (see Supplementary Information section $03), 
{The average depth of coverage on the mitochondrial genome (mtDNA) is 3484 + 1,515 (mean+-sel) and haplogroups were called with haplogrep (http://haplogrep.uibkac.at/) and haplofind 
(bttps://naplofind.unibait/), see Supplementary Information section $12 for details and 

+The average depth ot coverage on the ¥ chromosome (Yeh) Is289 +4 

provided in SOGG v 

Language group with which th 
be assigned toa single group with certainty 


(mean + 5.1). Haplogroup assignment was performed with an in-hause scrat that matched our SNPs withthe classification 
tary Information section S12 for detalls and references. 
ntifies, or ta which they were assigned, Where more than ane language is given, speakers either identified with mare than one group, or they could not 
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Extended Data Table 2 | Selection scan in Aboriginal Australians 
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Quantifying crater production and regolith overturn 
on the Moon with temporal imaging 


Emerson J. Speyerer', Reinhold Z. Povilaitis', Mark S. Robinson’, Peter C. Thomas” & Robert V. Wagner! 


Random bombardment by comets, asteroids and associated 
fragments form and alter the lunar regolith and other rocky surfaces. 
‘The accumulation of impact craters over time is of fundamental 
use in evaluating the relative ages of geologic units. Crater counts 
and radiometric ages from returned samples provide constraints 
with which to derive absolute model ages for unsampled units on 
the Moon and other Solar System objects', However, although 
studies of existing craters and returned samples offer insight into 
the process of crater formation and the past cratering rate, questions 
still remain about the present rate of crater production, the effect of 
early-stage jetting during impacts and the influence that distal ejecta 
have on the regolith. Here we use Lunar Reconnaissance Orbiter 
Camera (LROC) Narrow Angle Camera (NAC) temporal (‘before 
and after’) image pairs to quantify the contemporary rate of crater 
production on the Moon, to reveal previously unknown details of 
impact-induced jetting, and to identify a secondary impact process 
that is rapidly churning the regolith. From this temporal dataset, we 
detected 222 new impact craters and found 33 per cent more craters 
(with diameters of at least ten metres) than predicted by the standard 
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Figure 1 | Detection and distribution of new impact craters. 

a, b, Cropped example of a temporal pair (a, before image; b, after image) 
showing a 14.3-m impact crater (NAC frames M1104273380R and 
‘M1180855200R). ¢, Temporal ratio image created by dividing the after 


Neukum production and chronology functions for the Moon”. We 
identified broad reflectance zones associated with the new craters 
that we interpret as evidence of a surface-bound jetting process. We 
also observe a secondary cratering process that we estimate churns 
the top two centimetres of regolith on a timescale of 81,000 years— 
more than a hundred times faster than previous models estimated 
from meteoritic impacts (ten million years)*. 

‘Temporal imaging provides a means to directly identify, quantify 
and characterize new impact craters and other contemporary surface 
changes. Using metre-scale NAC observations, five new craters were 
identified® by comparing NAC images to Apollo Panoramic images. 
Recent impact flashes observed from Earth aided in the discovery of 
new craters with diameters of 18m and 34m (refs 7-9). Building on 
these findings and the growing imaging time base, we investigated 
14,092 NAC temporal pairs (see Methods) covering 2.49 x 10° km? 
(6.6% of the lunar surface) and detected changes that occurred between 
the observations (Fig. 1a-c). 

From this analysis, we discovered 222 newly formed impact craters 
with diameters between 43m and down to the resolution limit of the 


90° E 180° E 
image (b) by the before image (a). d, Distribution of 222 new craters 
discovered with temporal pairs (yellow markers). The red markers 
identify the two craters located with impact flashes, The region shown 
in a-c is 225m across, 
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Figure 2 | Annual cumulative size-frequency distribution of newly 
formed craters discovered with NAC temporal pairs. Owing to the 
identification of more craters than the NPF predicts, the NPF model age of 
the surface covered by temporal pairs is estimated to be 1.3 0.3 yr 

(fit not shown), or 1,2 40.3 yr when excluding the largest crater (43 m) 
from the fit. For reference, a l-yr isochron (grey line) and a fit using a 
Poisson calculation"? (black line) derived from the NPF is overlaid, The 
dashed vertical line represents the completeness limit for the current 
collection of temporal pairs. Error bars correspond to Poisson statistics. 


image pairs (about four pixels) (Fig, 1d). Using the statistics of the new 
crater sizes and a normalized surface area, we derived an initial, annual 
size-frequency distribution of the craters to estimate the non-saturated, 
contemporary crater-production rate (Fig. 2). This rate is compared 
to estimates derived from the Neukum production function (NPF) 
and corresponding chronology function, which is a commonly used 
model for estimating the formation rate of craters as small as 10m 
and is used to assign model age estimates to geologic units?, Although 
pixel scale limits the measured crater distribution at small diameters 
(<7 m), we find 33% more craters with diameters of >10m than the 
NPF predicts (16 yr~' in NAC temporal pairs versus 12 yr~' predicted 
by the NPF over the same surface area) and a model age'" derived 
from the NPF* that is 20% older than predicted. Given the modelled 
production rate of new craters? and our search area, there is only a 
16% chance of identifying 16 craters, which implies a possibly higher 
production rate for craters with diameters of >10m. Furthermore, 
in the 10-20-m diameter range, the power-law slope (—b) of the 
cumulative crater frequency of our initial crater dataset is steeper 
than the NPF predicts* (by,c= 4.64, 95% confidence interval (CI) of 
4.15-5.14 versus bypp=2.97-3.15). The steeper slope was also present 
when analysing the differential crater frequency over the same diameter 
range. The tentatively steeper slope, which is also seen in extrapolated 
cratering models derived from the known orbital and size distribution 
of asteroids and comets!!, implies a higher crater-production rate for 
smaller impactors than the NPF estimates (see Methods). 

Mapping of the individual impact sites reveals evidence of the 
formation process in the form of up to four distinct reflectance zones: 
proximal high reflectance zone (PHRZ), proximal low reflectance zone 
(PLRZ), distal high reflectance zone (DHRZ) and distal low reflectance 
zone (DLRZ). These reflectance zones are the result of ballistic and 
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jetted material modifying surface properties, The four reflectance 
zones were first documented’ around an 18-m impact crater formed on 
17 March 2013 (event recorded by Earth-based assets)*. 

From our analysis, we found that the proximal zones fall within 
several crater diameters D from the crater centre, In the 58 cases 
for which a PHRZ was identified, it was typically the closest reflec- 
tance zone to the impact crater and modified the surface reflectance 
(relative to the pre-impact surface) by 12% + 9%, and extended from 
the crater centre an average of 2.1 (median of 1.5D). The PLRZ, which 
‘was present in nearly all cases, exhibited a lower surface reflectance 
(—9% + 4% on average) when compared to pre-existing terrain, and 
extended an average of 3.2D from the centre of the crater regardless of 
the presence of a PHRZ (median of 2.9D). 

Beyond the proximal zone(s), we always found at least one distal 
reflectance zone. The DLRZ (—2% to —6% reflectance change) is 
visible around 84% of the new impacts. In addition, 24% of the new 
craters (including the 17 March 2013 crater”) have a DHRZ (1% to 4% 
higher reflectance). The DLRZ extends out from the proximal zone(s) 
ina ray-like pattern to a distance that is proportional to the area of 
the source crater (5.03D*"), whereas the DHRZ extends 0.52D** from 
the centre of the crater, with D and the solution in units of metres (see 
Methods). In some cases, such as the 17 March 2013 impact, the DHRZ 
is consistently closer to the crater than the DLRZ’; but we found that the 
order and placement varies from crater to crater (Fig. 3a, b). 

Both proximal zones around the 17 March crater are considered’ to 
contain the continuous ejecta blanket from the impact, with the albedo 
change depending on the relative optical maturity of the ejecta"? or 
changes to the local surface roughness. Our observations are consistent 
both with that interpretation and with the observation that only a 
subset of the impacts reached an optically immature layer to produce 
a PHRZ (Fig. 3c). The origins of the distal zones are not as clear. The 
DHRZ has been attributed’ to impact-induced jetting, and the DLRZ 
to sparse ballistic sedimentation from the later ejecta curtain churning 
the local regolith”'*", Further analysis of the new craters reported 
here suggests that jetting may have played a dominant part in forming 
both distal zones, 

Jetting occurs early in the crater-formation process when the shock 
wave produces a mixture of melted and vaporized material that is 
ejected at low angles and at extremely high speeds that can exceed the 
original speed of the impactor!*", We agree with the assertions of 
ref. 7 that jetted vapour may smooth and redistribute surface grains, 
a process that could destroy the highly porous (up to 90%) structure 
in the upper few centimetres of regolith'® and form the DHRZ. In 
addition, we propose that jetting could locally increase the surface 
roughness, resulting in the DLRZ. Small topographic features (for 
example, crater rims) have ‘shadows’ in the distal zones (DHRZ 
and DLRZ), which indicates that these zones formed from an effect 
travelling parallel to and along the surface (Fig. 3d, e), rather than from 
classic ballistic-sedimentation process. The pressure gradient between 
the jet and the vacuum of space causes the jet to expand and interact 
with the surface for tens to hundreds of crater diameters. To create 
the DLRZ, we propose that jetted melt and/or fine-grained regolith 
carried by the jetted vapour creates a series of impacts downrange that 
churn the upper several millimetres of regolith, which increases the 
surface roughness and creates macroscopic shadowing that reduces the 
reflectance”, This localized churning caused by the jetting may also 
increase the pore spacing in the upper regolith and form the thermo- 
physical cold spots observed around young impacts". 

We attribute the variation in the production of two distal zones to 
the proportions of vapour and melt (or other clastic material) within 
the jet itself", It has been proposed” that the initial and most highly 
shocked portion of the jet contains more vapour, whereas later-stage 
jetting tends to consist of more melt than vapour, Although the vapour 
portion of the jet may be able to effectively smooth the porous regolith 
similar to a blast zone”?*, the increased mass of the melted material 
in the melt-dominated portion of the jet can collapse to the surface and 
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Figure 3 | Signatures of crater formation in NAC temporal-pair ratios. 
a, Temporal ratio image showing rays of a DHRZ from a 19.5-m crater 
(ratio of NAC frames M1132496422R/M1101866147R). b, Temporal ratio 
image displaying a DLRZ extending 150D from a 12-m impact crater 
(ratio of NAC frames M1105837846R/M1121160416R). ¢, Temporal ratio 
image showing the distinct boundary between the two proximal and 
distal zones around an L1-m crater (ratio of NAC frames M1113630818R/ 
‘M1098309025L). The red dots in a~c mark the location of the crater. 

4d, Enlarged version of b showing ‘shadows’ in the DLRZ. caused by 

a small crater. e, NAC frame M1132496422R highlighting the DLRZ 
shadow shown in d with an arrow pointing to parent impact. 


churn the upper few millimetres of regolith. A lack of a DHRZ around 
some craters may indicate that the jetted material was less powerful 
or carried more fines and/or melt. Furthermore, we see evidence that 
the impact angle and local topography affects the resulting albedo 
patterns (Fig. 3). 


Figure 4 | Examples and annual size-frequency distribution of 
splotches. a, b, Temporal pair (a, before image; b, after image) showing 

a low-reflectance splotch (indicated by the arrow in b) (NAC frames 
M190544405R and M1182416597L). c, d, Temporal pair (c, before image; 
4, after image) showing a high-reflectance splotch (indicated by the arrow 
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In addition to surface modification adjacent to the new craters, 
NAC temporal pairs revealed over 47,000 other changes in surface 
reflectance. These changes in reflectance lacked rims, or any other 
resolvable topographic signature, We interpret these localized reflec- 
tance changes to be the result of small primary or secondary impacts 
and refer to them as “splotches” (Fig. 4a-d). Most (90.7%) of the 
splotches exhibit lower reflectance (average 4%) than did the same 
region before the splotches formed (Extended Data Fig. 1), whereas 
7.4% show an increase in surface reflectance (average +10%) and 
the remaining 1.9% have mixed reflectance patterns. Many of the 
splotches are nearly circular with diameters ranging from 30m down 
to the detection limit (about five pixels) (Fig. 4e), On the basis of the 
number of observed splotches, we estimate that 1.09 x 10° splotches 
with diameters of >10m form annually on the Moon and that they 
alone can alter 99% the surface over 1.0 x 10” yr. Using a power-law 
fit (see Methods) and extrapolating the results to Im yields an annual 
rate of 40,2 1.6 splotches per square kilometre with a 99% coverage 
rate after 8.1 x 10* yr. 

‘Although some of the splotches may contain unresolved primary 
impact craters (D <4 pixels), we propose that a majority are the result, 
of secondary impacts of poorly consolidated (or loose) regolith that 
only churned the target surface and do not form a classic impact 
crater. Localized groups of splotches observed around new craters 
are consistent with this interpretation, For example, the 17 March 
2013 impact site contained 248 splotches within 30.3km of the parent 
crater’. In addition, some splotches exhibit an asymmetric arrow- 
head shape or herringbone pattern that points back to new impact 
craters’ 

Such secondary impacts of loose clusters of regolith create a hum- 
mocky and pitted surface surrounded by a subdued rim with a ratio of 
depth to diameter of 1:30 (ref, 25). This inner morphology is similar to 
the widespread “raindrop” texture described by the Apollo astronauts”, 
Given such shallow excavation, most of the splotches would churn or 
garden only the upper few centimetres of regolith (mature zone), which 
would have similar optical maturity properties to the regolith at the 
surface. This shallow excavation is consistent with our observations, 
which indicate that high-reflectance splotches are typically present in 
areas where a thin layer of mature regolith might be expected (that is, 
on slopes of >15° and ejecta of Copernican-aged craters; Fig. 4c, d). 
‘Applying a conservative estimate of churning depth of 1:50 to the 
identified splotches, we predict that the top 20cm of regolith will 
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be altered over 1.0 x 10” yr by >10-m splotches. Extrapolating the 
splotch population down to >1m, our model predicts that only about 
8.1 x 10" yr is required to rework and garden the upper 2cm of regolith, 
which corresponds to a rate that is more than 100 times faster than 
previous models predicted from meteoritic impacts (10” yr)°. This 
rapid churning is consistent with Apollo drive core samples, in which 
the top 10-50.cm of regolith is locally reworked and homogenized’, 
and measurements of short-lived cosmogenic radionuclides (**Al, half- 
life t,2=7.3 x 10* yr) that indicate that the upper 2-3cm has been 
continuously reworked over a period of 10°-10° yr (refs 28-30). 

This rapid, localized, vertical churning affects the final thickness of 
the mature top layer of regolith, spurs volatile redistribution, accelerates 
the degradation of surface features and poses a minor hazard to future 
long-lived human and robotic surface assets. The impressive population 
of splotches compared to the number of new craters (47,000 versus 
about 220) stresses the importance of secondary impact processes 
on the lunar surface. We found that the 1-yr isochron derived from 
the NPF” is within the estimated uncertainty of our contemporary 
crater-production rate, which is consistent with the notion that the 
cratering rate has been uniform over recent geologic time. However, 
the discovery, in the limits of the available data, of more craters (33% 
more with D> 10m) and a potentially steeper cumulative slope in 
the 10-20-m diameter range suggests that the cratering rate in this 
size range has not been fully characterized in previous models and 
might not account for crater degradation caused by splotches and other 
impacts. Further temporal observations will reduce the uncertainty 
estimates and better describe the current accumulation rate of craters 
with diameters of 100m and smaller. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only inthe online paper. 
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METHODS 

LROC instrument. The LROC consists of Wide Angle Camera (WAC) and twin 
NAC that provide multispectral and high-resolution imaging, respectively. The 
twin NACs, designated as NAC-Left (NAC-L) and NAC-Right (NAC-R), are 
capable of acquiring panchromatic images at 0.5m per pixel from an altitude of 
50km. A typical NAC image consists of 5,064 samples and 52,224 lines, resulting 
in over 500 megapixels of image data in each NAC-L and -R observation pair. 

More information on the cameras and images can be found in ref. 31 and at 
http://lroc.sese.asu.edu. As of 15 June 2015, the LROC NACs have acquired over 
‘one million images of illuminated terrain, From this image collection, 14,092 are 
“observations of regions of the Moon where previous NAC observations with similar 
lighting geometry exist (Extended Data Fig. 2). These before and after image pairs, 
called ‘temporal pairs’ here, enable the search for changes to the lunar surface 
that occurred between the times when the first and second images were captured. 
‘NAC temporal imaging. For this analysis, we define a NAC temporal pair asa 
set of NAC observations of the same spatial area acquired under similar lighting 
conditions: an incidence angle of less than 50° to avoid shadows and a difference 
‘of less than 3° between the incidence angles of the before and after observations. 
‘The temporal gaps between observations span 176 to 1,241 Earth days (mean of 
490 days), owing to the orbital geometry ofthe spacecraft and the Moonillumination; 
the ground sampling distance ranged from 0.47 m to 1.75m (Extended Data 
Fig. 3). 138 images have been manually scanned" and 19 craters and 638 other 
surface changes were identified. However, manual scanning of the entire collection 
‘of temporal pairs is impractical. We systematically scanned more than 2.1 trillion 
pixels through a semi-automated process to cover more than 6.6% of the Moon. 

‘To develop an algorithm to identify surface changes, we used the collection of 
surface changes identified in ref. 32 asa baseline, 

‘Temporal imaging provides a means to derive a rate of surface change without 

the concern of the changes saturating the surface. In the case of impact craters, 
temporal pairs can derive the unsaturated crater-production rate for small craters. 
For small impacts craters, down to 10m in diameter, the Neukum production 
function (NPF) and corresponding chronology function relied on crater statistics 
for the Cone and North Ray craters?. Older surfaces of the Moon are eroded and 
saturated with craters with diameters of I km and smaller, making it impossible to 
derive an accurate production function in those regions. Therefore, the cratering 
rates derived from temporal pairs will ultimately help to approximate ages of 
very young (<50 Myr) features such as irregular mare patches and fresh lobate 
scarps on the Moon, and young terrain on other planetary bodies, with the aid of 
scaling laws. Additionally temporal pairs reveal and monitor other surface changes 
(that is, splotches and mass wasting events), surface effects of which cannot always 
bbe quantified using only post-image analysis. 
Automated change detection. Change Recognition using Images with Similar 
Phase (CRISP) isan automated change-detection program developed by the LROC 
team to identify surface changes using temporal observations. The program is split, 
into three functions: image registration, pixel- and area-based change-detection 
filtering, and image segmentation, CRISP starts with the Planetary Data System 
(PDS) formatted and archived experiment data records (EDRs) for each NAC 
observation that constitutes the temporal pair, Before inspecting the images for 
changes, we apply a radiometric calibration®* to each image, which converts 
the digital numbers (DN) in the EDR image into reflectivity (irradiance/solar 
flux, I/F). We then rectify the images into a common reference frame using the 
spacecraft ephemeris, a temperature-dependent pointing model for each NAC, 
and a numerical camera model that defines the internal geometry of the sensor 
and optics". This procedure enables images to be aligned within 20m without 
requiring any ground control points. Although sufficient for many cartographic 
purposes, a misregistration of 20m (40 pixels acquired at an altitude of 50 km) is 
too coarse for small-scale change detection (Extended Data Fig. 4). 

‘To achieve proper registration (<1 pixel), such that each surface feature in the 
before image is matched to the same feature in the after image, the image pairs are 
registered using an automatic co-registration tool called coreg in the Integrated 
Software for Imagers and Spectrometers (ISIS) developed by the US Geologic 
Survey Astrogeology Science Center™. coreg is an area-based image-matching 
program that takes a pattern chip from one image and compares it to a similarly 
sized subregion in a larger search chip. For each pixel in the search chip, coreg 
computes a goodness-of-fit parameter to determine the coordinates of the 
best image match. Here we use a normalized 2D cross-correlation factor” to 
‘quantitatively compare the pattern and search window chips. 

Once registered, we compute an image ratio by dividing the after image by the 
before image (Fig, 1a-c). Because the lighting and viewing geometries are similar 
and the images are registered within a pixel, the values in the ratio image are close 
to one except in cases where a change has occurred on the surface that affects the 
reflectance. This reflectance change can be caused by a new impact event, the 
‘emplacement ofa splotch or a recent mass-wasting event. 
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However, there are still some variations in the reflectance caused by natural 
features on the Moon. For example, the temporal ratio image can encounter 
sensor noise in shadowed portions of the image where the captured signal is low. 
Additionally, if there are slight differences in lighting or viewing geometries differ- 
ences in the before and after images, then small photometric differences can cause 
disparities in the resultant temporal ratio image. To separate the variations due to 
natural lighting and sensor noise, we create a second change-detection filter using a 
normalized 2D cross-correlation (NCC) factor for an m x m pixel area in the before 
image compared to the same-sized area in the same location of the after image. The 
values are similar to those computed using coreg, except that the pattern chip does 
not scan across the search chip. For an area with no detectable surface changes, the 
corresponding area in the NCC image contains values near one. When the m1 71 
pixel NCC filter encounters a substantial texture difference between the before and 
afier images, the recorded values in the NCC deviate from unity. 

‘Together, the ratio image and NCC image are used to identify surface changes 

in the before and after image pairs. The threshold values were empirically derived 
using the surface changes identified through manual scanning of 138 NAC 
temporal pairs", For the change to be detected by the algorithm, the reflectance 
in one of the pixels covering the feature must change by 2% in the ratio image. 
For each detected change identified by CRISP, a 200 x 200 pixel cut-out is 
generated for the before image, the after image, the ratio image and the change 
mask. These full-resolution sub-images are combined into an animated gifimage 
with metadata associated with the temporal pair (for example, image names, 
location and temporal difference between observations). Additional metadata, 
and information regarding the animated gif is stored in a PostgreSQL. database 
for continued analysis. 
Change classification. Detected changes are classified manually with an interface 
that blinks the sub-images (animated gifs) and displays associated characteristics 
that the user can flag: low-reflectance change, high-reflectance change, visible 
‘rater rim, associated rays and indication of emplacement direction, On the basis of 
measurements collected during our study, a user can classify a batch of thumbnails, 
‘on an average of one classification every 3.35. These user-supplied classification 
flags are integrated into the PostgreSQL. database for later analysis. This process 
not only catalogues the surface changes, but also removes any of the false-positives 
identified by CRISP. Fora change to be classified asa splotch or new impact crater, 
it must be unambiguous. Therefore, we omit cataloguing some of the very small 
impacts (fewer than four pixels) and small splotches (fewer than about five pixels), 
‘owing to a lack of visual evidence in the temporal image pair. Therefore, we are 
underestimating the population of small surface changes, and surface changes that, 
are not creating distinct contrast boundaries with their surroundings. 

In cases where a rim is identified, we measure and record the diameter of the 
rater, For an accurate diameter measurement, the crater must span at least four 
pixels, For our temporal dataset, for which the ground sampling distance ranged. 
from 0.47 mto 1,75m (Extended Data Fig. 3), the crater with the smallest diameter 
that could be visible in all temporal pairs is 7m across (1.75m per pixel x 4 pixels). 
‘Normalizing spatial coverage with respect to time. Because the current spacing 
between NAC temporal-pair observations varies between 176 and 1,241 days, we 
scaled the search area for each NAC temporal-pair observation by calculating the 
product of the actual area searched (A,) and the temporal gap (At), and dividing 
itby a year period to maintain the units of area. The sum of this expression results 
in the total annual search area: 


Aumaus = 39 AA 


lyr 


‘This technique, which was used in ref, 36 to derive the contemporary cratering rate 
‘on Mars, yielded an annual surface area of 3.31 x 10°km*, or 8.71% of the Moon 
observed by NAC temporal imaging (original non-scaled area of 2.49 x 10°km? 
‘or 6.57% of the Moon). Using this normalized surface area, we derived cratering 
rates (Fig. 2) and a production function for splotches (Fig, 4). 

Cratering rate. Using the crater statistics and the normalized surface area 
(3.305 « 10%km* yr~), we evaluated the current crater-production rate (Fig. 2 
and Extended Data Fig. 5). To provide a basis for comparison, we evaluated our 
statistics with the NPF, which isa standard used to estimate model ages for geologic 
units using size-frequency statistics of craters with diameters between 10m and 
300km (ref. 2). The NPF predicts that 140 craters with diameters of > 10m form 
annually on the Moon, From the area covered by the NAC temporal pairs scaled 
toa L-yr period, the NPF estimates that 12 craters (D> 10m) should have been 
located. However, the 14,092 randomly targeted temporal pairs revealed 16 impact, 
craters (D> 10m), This total excludes the 18-m and 34-m craters discovered by 
targeted observation after observed impact flashes”®, Using a hypergeometric 
distribution that accounts for the area searched and the modelled crater-production, 
rate, we found that there is only a 16% probability of finding 16 or more impact 
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craters in the NAC temporal dataset. Therefore, on the basis of our tentative crater 
statistics, the current crater-production rate is potentially higher than the NPF 
predicts. 

‘To see the effect of an increased cratering rate on an absolute model age, we 
fit the NPF to the 15 new impact craters discovered with diameters of 10-20m 
and found an age that is 20% older than modelled when assessed with a Poisson. 
timing analysis! (measured age of 1,2+0.3 yr). In this initial analysis, we omitted 
the largest crater (43m) from our fitting, because craters in that size range are not 
‘common enough to form on an annual basis in the search area. The NPE predicts 
that 1-2 (1.56) craters with diameters of >43 m form annually on the entire 
Moon®, Given the search area, there is only a 14% chance of finding a crater with a 
diameter of >43m. Therefore, including the 43-m crater in our absolute model age 
derivation could falsely increase the model age. However, if we included the 43-m 
crater, then the measured model age increases to 1.3 +0.3 yr. As LROC acquires 
more temporal-pair observations throughout the rest of the extended mission, the 
increased spatial and crater frequency statistics collected will enable us to refine the 
size—frequency distribution of craters with diameters of up to 75-100, 

Using the same crater population (10m < D-< 20m), we found that the cumula- 
tive power-law slope (—b) derived without binning the craters is potentially steeper 
than the NPF estimates (}= 4.62; 95% Cl of 4.15-5.14 and 2.97-3.15, respectively). 
A fit to the differential size-frequency distribution of the craters also yielded a 
potentially steeper slope compared to the NPF (bays= 5.11; 95% Cl of 2.27-6.94 
and 3.97~4.15, respectively). These findings imply that if the crater-production rate 
is higher in this diameter range, then the model ages derived with the NPF using 
small craters near the 10-m cut-off may contain a systematic offset. 

In addition to the NPF, there are other crater production functions derived from 
‘our current knowledge of the asteroid and comet population in the inner Solar 
System. First, the model production function (MPF) in ref. 37 with the Holsapple 
and Housen scaling laws applied predicts fewer craters forming in the size range 
examined here and a distribution with a similar cumulative slope to that of the 
NPE Second, the crater size-frequency distribution in ref. 11 that is valid for 
crater diameters between 100m and 1,000km predicts that we should have located 
17 craters when we extrapolated this distribution down to craters with diameters 
of >10m (we identified 16). Additionally, for the 10-20-m diameter range, the 
cumulative power-law slope ranges between 3.56 and 4.50 in the extrapolated 
model", which overlaps the confidence interval we derived from our dataset (4.62; 
95% Cl of 4.15-5.14). However, this increased rate and steeper slope seen in the 
extrapolated model and data presented in ref. 11 contradicts other models based 
‘on counts of terrestrial bolides®* and observations of shallower power-law slopes 
derived from new terrestrial bolides” and recent Martian craters. However, this 
shallower slope in the Martian dataset might be the result of rapid weathering of the 
blast zones observed around small Martian craters, which could make it difficult 
to identify the smallest new craters", Analysis of additional temporal image pairs 
will ultimately strengthen our crater statistics and will help to refine the rate of 
crater production and to validate cratering models. 

Characterizing impact sites. We found that each new impact crater discovered 
over the course of this investigation was surrounded by up to four distinct reflec~ 
tance zones that are the result ofthe crater-formation process. One of the clearest 
‘examples of all four reflectance zones is around the impact crater that formed on 
17 March 2013 (ref. 7) (Extended Data Fig. 6). We identified the visible zones and 


measured the maximum distance spanned by each reflectance zone from the centre 
of each new crater (Extended Data Fig, 7 and Extended Data Table 1). Statistics 
regarding the span of the proximal zones are provided in Extended Data Table 2 
Owing to the difficulty in measuring the size of small craters that are near the 
resolution limits of the observations, we omitted craters smaller than 3m from 
the reflectance-zone analysis; the statistics regarding the 43-m crater are omitted 
in the figures, owing to a lack of adequate before imaging near the impact site. 
‘Modelling splotch accumulation. ‘The varying spatial resolution of the temporal 
pairs causes a “roll-off” in the size-frequency distribution for splotches with 
diameters of <10m (Fig. 4e). Using a least-squares power-law fit for splotches 
with diameters of 10m (the diameter at which splotches are resolved at all pixel 
scales) to 40m, we estimate that 0.0029 + 0.0001 splotches with diameters of 
>10m form over an area of km? annually (Fig. 4e), which is equivalent to 
1,09 x 10° splotches forming across the entire Moon annually, Using the power-law 
1,483 x 10", 95% Cl of (1.459-1.508) x 10-" b= 4.144, 95% CL 
of 4,148-4.141; dis the splotch diameter in metres; R® =0,9991) and extrapolating 
the results to 5m and to Im yields an annual rate of 0.0510+0,0018 and 40.2 + 1.6 
splotches per square kilometre, respectively, with an annual global accumulation of 
1,93 10® and 1.52 x 10°, respectively. Using this splotch production function and 
‘a Monte Carlo model that randomly emplaces splotches, we estimate that 99% of 
the lunar surface would be altered after 1.0 x 10” yr from splotches with diameters 


of 10m. By combining the coverage rate and assuming a conservative churn 


depth with respect to splotch diameter, we can estimate regolith gardening’ 
rates from the emplacement of splotches on the surface. Using a ratio of depth 
to diameter of 1:50 for these features, which isa conservative estimate based on. 
laboratory experiments of clustered impacts at similar speeds", we estimate that 
these events would effectively churn the upper 20cm of regolith after 1.0 > 107 yrs. 
Extrapolating the rate of splotch formation down to 5 mand 1m drastically reduces 
the timescales to 2,3 x 10° yr and 8.1 x 10* yr, respectively. Applying the same 
assumption regarding the ratio of depth to diameter to the latter simulation yields 
aregolith gardening rate of about 10° yr for the upper 2cm. 
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Extended Data Figure 1 | Reflectance changes caused by emplacement 
of splotches. Histogram of reflectance changes associated with 
low-reflectance (top; n= 18,756) and high-reflectance (bottom;  =1,757) 
splotches. 
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Extended Data Figure 2 | Distribution of NAC temporal pairs. Image footprints colour-coded by the number of days between the before and after 
observations (June 2009 to May 2015). The size of image footprints are exaggerated for display clarity. 
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Extended Data Figure 3 | Temporal-pair statistics. Left, distribution of the area covered by individual temporal pairs versus the temporal spacing 
between the corresponding observations. Right, histogram of the ground sampling distance for each after image of the NAC temporal pair: 
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Extended Data Figure 4 | Effect of image registration errors. 
a-d, Examples of a temporal ratio image with decreasing pixel offsets 
(ratio of NAC frames M188678240LR/M1180548227LR): 10-pixel offset 
(7.8-m offset; a); 5-pixel offset (3.9-m offset; b); 3-pixel offset (2.3-m 


offset; €); and <1-pixel offset (<0.8-m offset; d). The larger offsets in 
a-c make the identification of the new impact crater impossible. 
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Extended Data Figure 5 | R-plot of new crater population. Relative 
crater frequency R of the 222 new impact craters identified with temporal 
imaging, For reference, a 1-yr isochron (grey line) derived from the NPF 


of Poisson statistics of counts. 


overlaid for diameters of >10 m. Error bars are estimated on the basis 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


LETTER 


Extended Data Figure 6 | Details of the impact process recorded in the temporal pair. Temporal ratio image of the 17 March 2013 impact site 
surrounded by four distinct reflectance zones (ratio of NAC frames M1129645568L/M1836897891). 
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Extended Data Figure 7 | Range of reflectance zones associated with new impacts. Maximum zone distance versus crater diameter for each 
of the four reflectance zones observed around new impacts. Craters smaller than 3 m in diameter were excluded from the least-squares fit. 
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Extended Data Table 1 | Least-squares fit to maximum zone distance 
compared to crater diameter 


Zone Fit type a (95% Cl) b (95% Cl) Re 

PHRZ Linear (a+b) 1.81 (1.50t02.11) 0 (fixed) 0.169 
PLAZ Linear (aD+b) 3.52 (8.32t0.3.71) 0 (fixed) 0.747 
DHRZ Power(aD"s) 0.52 (0.11 00.92) 2.46 (21710275) 0.963, 
DLAZ Power(aD"b) 5.03 (3.8406.21) 1.95 (1.86102.04) 0.981 


Fit information forthe plots shawn in Extended Data Fig. 7, where D is the diameter at the crater in metres. 
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Extended Data Table 2 | Span of the proximal reflectance zones 


Zone — Mean Median Standard Minimum = Maximum 
doviation distance _distance 

PHRZ 2.110 1,50 1.46D 0.840 8.560 

PLRZ 3.16 2.910 1.10 1.330 9.850 


Statistical summary ofthe span ofeach proximal reflectance zane, Each value has been normalized to the crater diameter O. 
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Enhanced flexoelectric-like response in oxide 


semiconductors 


Jackeline Narvaez!, Fabian Vasquez-Sancho! & Gustau Catalan! 


Flexoelectricity is a property of all dielectric materials whereby they 
polarize in response to deformation gradients such as those produced 
by bending’. Although it is generally thought of as a property of 
dielectric insulators, insulation is not a formal requirement: in 
principle, semiconductors can also redistribute their free charge 
in response to strain gradients. Here we show that bending a 
semiconductor not only generates a flexoelectric-like response, 
but that this response can in fact be much larger than in insulators. 
By doping single crystals of wide-bandgap oxides to increase their 
conductivity, their effective flexoelectric coefficient was increased by 
orders of magnitude. This large response can be explained by a barrier- 
layer mechanism that remains important even at the macroscale, 
where conventional (insulator) flexoelectricity otherwise tends to be 
small. Our results open up the possibility of using semiconductors 
as active ingredients in electromechanical transducer applications. 

Unlike piezoelectricity (polarization induced by homogeneous defor- 
mations such as stretching or compressing), flexoelectricity (polariza- 
tion induced by inhomogeneous strain such as bending) is allowed by 
symmetry in all materials and is therefore a more widespread propert 
of solids. However, the charge density that can be generated by flexoe- 
lectricity is normally small, which limits its practical appeal. This has 
motivated a search for ways to enhance flexoelectricity, which has so far 
yielded two tried-and-tested strategies. The first involves exploiting the 
proportionality between flexoelectricity and permittivity'"*: materials 
with high dielectric constants also have high flexoelectric coefficients**, 
‘The second involves maximizing strain gradients by working at very 
small (nanoscopic) size scales”, at which maximum achievable defor- 
mations are larger—put simply, a thin film can be bent more than a 
thick slab. The first strategy forces one to work with high-permittivity 
materials and the second limits the size range in which flexoelectricity 
is competitive. Achieving larger responses in a wider range of materials 
and, crucially, to do so at the macroscale, are among the most pressing 
challenges in the field. 

Although bulk flexoelectricity has a theoretical limit that cannot 
be exceeded"**, there are caveats to this limitation, First, bending a 
material not only elicits polarization from bulk flexoelectricity, but also 
from surface piezoelectricity**""", which is caused by the strains on 
the opposite sides of a bent crystal (compression on the concave side, 
extension on the convex side). Because all surfaces are asymmetric (and 
therefore piezoelectric), surface piezoelectricity is, from the points of 
view of symmetry and functionality, as universal as bulk flexoelectricity, 
and is in fact regarded as an actual intrinsic component of the total 
flexoelectric response®!, 

‘The second caveat has to do with the permittivity itself, which is a 
measure of the polarizability of a material. In a dielectric insulator, it 
is given by how much (and how easily) positive and negative bound 
charges can be separated. However, the effective polarizability of a 
capacitor structure (that is, its capacitance) can be enhanced above the 
dielectric limit if we also allow free charges to separate. The physics of 


this phenomenon is described by the Maxwell-Wagner model” and is 
exploited within the capacitor industry to increase the storage density 
of so-called ‘barrier-layer capacitors", The basic idea is depicted in 
Fig. 1b: when an electric field is applied to a heterogeneous material 
consisting of insulating barrier layers separated by a (semi)conducting 
region, the conducting region responds by allowing its free charges to 
move across the barrier layers, thus effectively behaving as an interca- 
lated electrode. In this scenario, only the thin barrier layers contribute 
to the capacitance and, because capacitance is inversely proportional 
to thickness, enormous capacitances! and even increased piezoelec- 
tricity! can be achieved, As we show here, the barrier-layer mechanism 
can elicit much more bending-induced charge from semiconductors 
than from insulators. 

To explore this idea, we studied single crystals of BaTiO, doped with 
oxygen vacancies to increase their conductivity", Henceforth, as- 
received BaTiO, is labelled BTO; oxygen-depleted BaTiO,_j (made by 
vacuum annealing, see Methods) is labelled BTO_s, and subsequently 
re-oxidized BaTiO, made by re-annealing in an oxygen atmosphere 
(see Methods) is referred to as BTO,,». Fully oxidized BTO is an 


: ¥ 
BaTiO, 
Intertace ok, 
uk 
intertace ok 
b 
BaTiO, , 
Interface 
Bulk 
Interface 


Figure 1 | Barrier layer model. a, A dielectric responds to voltage as 
three capacitors in series, with one bulk and two interfacial layers. The 
same is true for its electromechanical response to bending, with polar 
contributions (sketched as green arrows) from bulk flexoelectricity and 
surface piezoelectricity. b, A semiconductor can form depletion layers at 
the interfaces with the electrodes; in these circumstances, the conducting 
bulk acts as an intercalated electrode (blue layer) between interfacial 
barrier layers that respond as thin capacitors. This results in larger 
capacitance (inversely proportional to barrier thickness) and enhanced 
surface piezoelectricity, owing to screening of the internal depolarizing 
field by free charges (sketched as blue and red spheres) 
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Figure 2 | Capacitance of BTO. a, b, Capacitance per unit area, C/A, 

(a) and dielectric loss tangent (b) as a function of temperature for BTO 
(black), BTO_» (red) and BTO ,« (blue). The inset in a shows plan-view 
photographs of the reduced (top) and oxidized (bottom) crystals. The 
reduced crystal has a darker colour owing to increased light absorption by 
free carriers (the even darker regions at the edges are shadow effects from. 
the illumination; the edges become chipped as a result of thermal stress 
during annealing). 


archetypal ferroelectric with high permittivity and flexoelectricity*®, 
In terms of conductivity, itis a wide-bandgap (3.5eV) material with 
reasonably good dielectric insulation. However, when doped with 
‘oxygen vacancies, BTO becomes an n-type electronic semiconductor” 
with charge-depleted surfaces", BTO _»is also a good ionic conduc- 
tor at high temperature™, with the vacancies acting simultaneously as 
electron donors and as ionic charge carriers. This is relevant because 
ionic defects such as vacancies can also respond to strain gradients via 
the Vegard effect; such a ‘flexoionic’effect would be the inverse of the 
electrochemical strain observed in ionomers, which can be quantita- 
tively comparable to bulk flexoelectricity in mixed ionic-electronic 
conductors". However, at the relatively low temperatures (by ionic 
standards) of the present work, the ionic contribution to the conductivity 
was found to be rather small (see Methods); nevertheless, to exclude the 
role of ions, we have also studied the response of pure and Nb-doped 
TiO, the latter being a vacancy-free electronic semiconductor at room 
temperature™. 

‘The capacitance density and dielectric loss of the BTO crystals are 
shown in Fig. 2. BTO_¢ shows classic signatures of Maxwell-Wagner 
behaviour: high dissipation (loss tangent), consistent with increased 
conductivity, and increased capacitance, consistent with the existence 
of thin insulating barrier layers®*"'!5"7""2°, Because the ratio between 
the capacitance of BTO_ and BTO is proportional to the thickness 
ratio between the bulk and the interfacial barrier layers, we can estimate 
an upper bound of about 1 j1m for the thickness of the barrier layer 
(see Methods). 

The effective flexoelectric coefficient jie is defined as the change in 
polarization (measured as charge per unit area collected at the electrodes) 
divided by applied strain gradient (beam curvature)*, and is plotted as a 
function of temperature in Fig. 3; the measurement set-up (see Methods) 
is as previously used for other insulating crystals". The results show that 
the effective flexoelectricity of BTO sis two orders of magnitude larger 
than for insulating BTO. This enhancement persists at temperatures 
above which the ferroelectric phase (where macroscopic piezoelectricity 
can yield spurious enhancements”) is stable, and also above those at 
which polar nanoregions exist2™, so it cannot be attributed to residual 
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Figure 3 | Effective flexoelectricity of BTO. The transverse effective 
flexoelectric coefficient (j1{4; charge density collected at the electrodes 
divided by applied strain gradient) is plotted as a function of temperature 
for reduced (conducting) BTO_ and oxidized (insulating) BTO and 
BO ,». The conducting sample shows a two order of magnitude 
enhancement, reaching into the millicoulomb per metre range. 


ferroelectricity either. The intrinsic properties of the as-received (BTO) 
sample can be recovered upon reoxidation (BTO ,,), so the enhancement 
is reversible and clearly caused by the doping. 

‘The effective flexoelectric coefficient of BTO_s reaches values close 
to 1mC m~'—the largest so far reported for any material °, We argue 
that the origin of this large response is not bulk flexoelectricity, but 
its close relative, surface piezoelectricity. As mentioned earlier, in an 
insulator, these two effects are regarded as inseparable components of 
the total response**""-", However, in a semiconductor with insulating 
barrier layers such as that depicted in Fig. 1, there are additional con- 
siderations: (1) the bulk polarization can be screened by free charges”, 
so bulk flexoelectricity should no longer contribute to the total 
polarization of the capacitor structure, and this allows us to separate 
out the surface contribution; and (II) bulk free charges can also screen 
the depolarization field that would otherwise appear at the interface 
between the barrier layer and the bulk. 

In effect, such a system should behave as a bimorph transducer con- 
sisting of two piezoelectric barrier layers attached to the opposite surfaces 
of a conducting slab that acts as an intercalated electrode. The phenom- 
enology is somewhat reminiscent of Haertling’ ‘rainbow’ transducers”, 
which use asymmetric annealing to turn one side ofa piezoelectric mate- 
rial into an electrode. Our crystals are instead symmetric, but there are 
more fundamental differences: (I) although we have annealed BTO to 
make it semiconducting, the barrier-layer mechanism can, in principle, 
appear in any semiconductor (doped or intrinsic) as long as it has 
high-resistivity interfaces such as Schottky barriers; and (Il) the starting 
material does not need to be piezoelectric because the polarization comes 
from the surface, which is always piezoelectric. These two points are illus- 
trated by additional measurements performed on the centrosymmetric 
semiconductor Nb-doped TiO; (rutile), which displays a 1,000-fold flex- 
electric enhancement compared to undoped, insulating TiO) (Fig. 4). 

In the proposed scenario, the bending-induced charge density is the 
product of the surface’ transverse piezoelectric constant (e%*) and the 
surface strain, while the surface strain is the product of the curvature 
(G) and the half-thickness of the crystal (t/2). Hence, the bending- 
induced surface polarization is P=e}3"Gt/2 and the effective flexo- 
electric coefficient (defined as polarization P divided by curvature G) is 

ug =e a) 
2 
Itisalso possible to arrive at this equation by starting from the general 
expression for the surface-piezoelectric contribution to the effective 
flexoelectricity of a dielectric!” 
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Figure 4 | Effective flexoelectricity of TiO; and 0.05%Nb-doped TiO;. 
For two crystals of identical thickness, bending the semiconductor 
(Nb-doped TiO; filled red circles) generates three orders of magnitude 
more change than does bending the insulator (TiO; black filled circle) 
‘The effective flexoclectric coefficient (y%) of the semiconductor is 
proportional to sample thickness, consisient with a barrier-layer model. 
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(where f, fj, are the thicknesses of the interfacial and bulk layers, respec- 
tively, and ¢,, <p are their dielectric constants), assuming fy > f 
and taking the limit as the bulk dielectric constant tends to infinity, 
which is physically equivalent to stating that the bulk can perfectly 
screen any depolarizing field. The role of our conductive bulk thus 
becomes clear: it not only isolates the contribution from surface piezo- 
electricity (itself a hitherto unresolved problem!®-'), but it also 
enhances this contribution by screening the polar discontinuity at the 
interface between the piezoelectric barrier layer and the bulk, thus 
allowing the surface to reach polarization values that would be unat- 
tainable in a finite-permittivity insulator. We can also use Equation (1) 
to calculate the size of the surface piezoelectric coefficient (see 
Methods). For BTO_,, we found the surface's transverse piezoelectric 
charge coefficient (piezoelectric constant e multiplied by elastic 
compliance), d33", to range from 37 pC N~ at the Curie temperature 
peak, Te =125°C, to 0.6 pC N“ (similar to the coefficient of quartz) 
at 300°C; for comparison, the transverse piezoelectric coefficient of 
BaTiO, at room temperature is 36.5 pC N~! (ref. 30). The piezoelectric 
coefficients of surface layers are therefore comparable to the bulk 
piezoelectric coefficients of standard piezoelectrics and do not need to 
be ‘giant’ to yield a large flexoelectric-like response. 

According the barrier-layer model, and as shown by the results in 
Fig. 4, the effective flexoelectric coefficient of semiconductors is 
proportional to the thickness, so it cannot be considered a material 
constant. Instead, the material constant that dictates the flexoelec- 
tric-like response of the device is the surface piezoelectric coefficient. 
In macroscopically thick samples, this surface contribution can not 
only quantitatively match bulk flexoelectricity, as predicted for insula- 
tors!" but for semiconductors it can surpass it. Thus, the thickness 
proportionality of the effective flexoelectric coefficient means that the 
electromechanical response is large not only at the nanoscale, at which 
conventional (insulator) flexoelectricity is already important, but also 
at the macroscale. The magnitude of the effect, its persistence at the 
macroscale, and the ubiquity (and, generally, lead-free composition) 
of semiconducting materials are all positive news for device design 
and will hopefully stimulate new research into semiconductor-based 
flexoelectric and electromechanical applications. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only in the online paper. 
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METHODS 
Materials. The samples used in this work were as follows. (I) Single crystals of 
(001)-oriented BTO, commercially acquired from SurfaceNet, with dimensions 
of Imm x 10mm x 0.46mm (width x length x thickness). Reduction of the 
BTO_,sample was achieved by annealing for 2h ata temperature of 900°C inside 
a vacuum chamber at a base pressure of 10~® mbar, The sample was heated at 
10°Cmin * up to the annealing temperature; after annealing it was quenched 
back to room temperature in vacuum by turning off the heater, resulting in an 
exponential cooling rate with a time constant of 15min The re-oxidized sample 
(BTO.,.) was made by re-annealing at 800°C. for 30h in a pure (99.9999%) Os 
atmosphere. (II) Undoped and 0.05% Nb-doped (by weight) single crystals of rutile 
‘TiO, were acquired from Shinkosha (http://www-shinkosha.com) and polished by 
us to different thicknesses. The dimensions were 20mm x 2mm x 0.175-0.53mm 
(length x width x thickness). 
Flexoelectric and dielectric measurements, ‘The method for measuring the 
flexoelectric coefficient is described in ref. 26: a dynamic mechanical analyser 
(Perkin-Elmer DMA-8000) was used to deliver an oscillatory three-point bending 
stress with 2jum of vertical displacement to beam-shaped single-crystal samples 
placed between two supporting edges separated by 8mm. The bending-induced 
displacement currents were measured using a lock-in amplifier (Stanford Research 
‘SR830 DSP) and converted to charge density by dividing the measured current by 
the frequency of the applied mechanical stress and by the area of the electrodes. 
‘The frequency ofthe oscillatory force in our dynamic mechanical analyser (DMA) 
‘was set to 13 Hz; a prime number was chosen to avoid ringing interference from 
the main power lines. The capacitance and dielectric loss were measured at 1kHz 
with an Agilent LCR-meter model E4980A. 
Calculation of surface piezoelctrcity By inverting Equation (1) weisoate the 
transverse surface piezoelectric coefficient: e}3"" = 2," /t, where tis the thickness 
of the crystal and jis the experimentally measured effective flexoelectric coef- 
ficient of the semiconducting crystals (Fig. 3; please note that Equation (1) is only 
valid ifbulk lexoelecticity is iternally screened by the conductivity of the rystal 
Here eft is the strain-polarization piezoelectric coefficient of the surface, 
which can be converted into the more familiar force-charge piezoelectric coeffi 
cient (d,, with tensorial subindices i,j indicating respectively the type of strain in 
‘Voigt notation and the direction of polarization in Cartesian coordinates) multi- 
plying by the in-plane elastic compliance, sy; (ref. 31): 


@ 


Using the results in Fig. 3 and the in-plane elastic compliance s:=8.5 x 10" Pat 
for barium titanate™, we obtain the surface piezoelectric coefficient of BTO_., 
which ranges from a peak value of 37 pC N-! at To = 125°C to 0.6 pC N' at 
300°C. ‘That this surface piezoelectricity is measured above the Curie temperature 
of BTO and so cannot be due to bulk piezoelectricity. 

‘Thickness dependence of the effective flexoelectric coefficient. Although we did 
‘not manage to measure BTO crystals of different thicknesses (our thinned-down 
samples snapped when going through the ferroelectric-ferroelastic transition), we 
‘were able to measure Nb-doped TiO; of different thicknesses. The results are 
shown in Extended Data Fig. 1. The effective flexoelectric coefficient grows in 
proportion to the crystal thickness, which rules out a bulk mechanism (either 
Mexoelectricity or inverse-Vegard electrochemical strain”), More data are required 
for accurate quantification of the linearity; for our limited sample set, the effective 
flexoelectric coefficient appears to grow ata rate faster than linear. Nevertheless, 
a linear regression to the available results (red line in Extended Data Fig. 1) yields 
a slope that can be introduced into Equation (2) to yield the surface piezoelectric 
coefficient. The result is d{3" = 16 pC N-!—a reasonable value that is between 
the piezoelectric coefficients of BTO and quartz 

‘Thickness of the skin layer. Using a series capacitor model, we estimate an upper 
bound for the skin layer thickness. In the barrier-layer capacitor model, the capac~ 
itance of the conductive sample corresponds to the interfacial barrier layers, with 
the conductive core of the crystal acting functionally as an intercalated electrode. 
‘Therefore, the capacitance of the conductive crystal is 


where A is the electrode area, (is the thickness ofthe interfacial barrier layer, and 
visits dielectric constant; the factor of 1/2 comes from assuming that there are 
thwo identical interfacial barrier layers with their capacitances added in series (that 
is, reciprocally), 

‘The capacitance of the fully oxidized (dielectric) sample is given by 
1Gyr0 =2/C + Gy, where G,, Cy are the capacitances of the bulk and the two 
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interfacial layers. Ifwe assume that the capacitance of the interface is much larger 
than that of the bulk—which is a reasonable assumption given that capacitance 
is inversely proportional to thickness and the interfacial layers are expected to be 
much thinner than the bulk—then we can simplify this expression by neglecting 
the interfacial contribution, so that: C10 = eyA/ty, Where cy and f, are respectively 
the dielectric constant of the bulk and its thickness. Therefore, the fatio between 
the capacitance of the conducting crystal and that of the insulating crystal is pro- 
portional to the thickness ratios between the bulk and the interfacial barrier layers: 


‘where f,, and =,p are the thicknesses and relative dielectric constants, respectively, 
of the interface (“!) and the bulk ('b). ‘The crystal has a thickness of 460 jim, and 
the effective capacitance of the conducting crystal is approximately 200 times larger 
than that ofthe insulating crystal (see Fig, 2); therefore, the thickness of each inter- 
facial barrier can be, at most, \= 1/2 x 460/200= 1.15 jim. Figure 2 assumes that 
the dielectric constant of the barrier layer is similar to that of the bulk. Ifthe local 
dielectric constant of the interfacial barrier layer was smaller than that of the bulk 
(which would be unusual in ferroelectrics, in which the interface behavesas a dead 
layer), then the result must be corrected by a factor of “yietace!“wak Hence, A 1 yum 
isan upper bound, with the real thickness of the barrier layer likely to be smaller. 
Re-oxidation kinetics and oxygen content. Although sample preparation was 
done in vacuum, the flexoelectric measurements were done in a DMA with an air 
atmosphere and with temperatures reaching up to 300° 
of the sample is possible, 

‘The re-oxidation kinetics of ceramic BaTiO; doped with Ho;0s are described 
in ref. 32. The lowest measurement temperature therein is higher than our maxi- 
‘mum measurement temperature: 450°C versus 300°C; however, we can extrapolate 
results of ref 32 to obtain an expected diffusion coefficient of D= 1.6em"s~! at 
300°C. The diffusion length is.\ = 2./Dr, where ris time; therefore, to re-oxidize 
1m (maximum estimated thickness of our barriers), the required time would be 
only 15s, which is well within the timescale of our measurements. According to 
these calculations, itis possible that the barrier layer is re-oxidized and that this is 
the reason for its insulating properties. 

However, the diffusivity reported in ref. 32 is for doped ceramics, which have 
lower activation energies and higher ionic conductivities than do undoped single 
crystals. If, instead, we extrapolate the results for undoped BaTiO; reported in 
ref. 33, then we obtain a much lower diffusivity at 300°C: D = 4 


80, partial e-oxidation 


x10-Memts, 
Such low diffusivity would require 32,000 (almost 9h) for the oxygen to diffuse 
toa depth of | jim. Therefore, although surface re-oxidation is possible, given the 
large range of diffusion coefficients in the literature, we cannot state the depth 
of the re-oxidized region with certainty or, consequently, discern whether the 
insulating barrier is caused by local re-oxidation or by electronic charge depletion 
(that is, Schottky barriers) atthe semiconductor-electrode interface. 

‘To shed more light on this question, we performed an Electron Energy Loss 
Spectroscopy (EELS) analysis ofa cross-section of BTO _, (Extended Data Fig. 2, 
top). The results do not show a monotonic trend in the Ti L absorption peaks that 
are typically used to characterize oxygen content in perovskite titanates: the peaks 
shift initially to higher energies (consistent with higher oxidation”), going from 
depth of 1 um to 150m, but then shift back to lower energies (consistent with 
lower oxidation) at a depth of 10nm. A comparison with the Ls peak in the EELS 
spectra of oxygen-deficient SrTiOs_» (ref. 34) and of BaTiOs.. (ref. 35) (Extended 
Data Fig. 2, bottom) shows that the Ls peak split, which is inversely related to 
‘oxygen content, is consistent with a vacancy concentration of 0.07 <5<0.14, or 
between 2,3 mol% and 4.6mol%, 

‘We can quantify vacancies more accurately using gravimetry experiments. 
Using a high-precision analytical balance (Sartorius CPA225D), we measured the 
weight of a Ba TiOs single crystal before and after vacuum annealing, The initial 
weight of the sample (10mm x 1mm x 2mm; length x thickness x width) was 
123.12 mg, The weight after vacuum annealing was 122.14mg. Assuming that the 
‘weight reduction is due to oxygen loss, the decrease in weight (0.98mg) represents 
‘an oxygen loss of 3.9 mol% (6 = 0.12) which is in the range estimated by the EELS 
analysis. 

Reproducibility. Measuring flexoclectricity involves bending brittle crystals that 
tend to break and are not easy to replace; repeating measurements on different 
crystals is thus not an easy task. Nevertheless, for the reduced BTO¢ sample, we 
managed to measure several heating and cooling cycles and obtained similar results 
forall of them (Extended data Fig. 3). For the conductive Nb-doped TiO, crystal, we 
also measured two samples with thicknesses of ~0.5mm, yielding results that dif- 
fered by less than 20%, These two data points are included in Extended Data Fig. 1, 
Electronic versus ionic origin of the enhanced flexoclectricity. Oxygen-deficient 
BTO is a mixed ionic-electronic conductor. Consequently, oxygen vacancies can 
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simultaneously act as ionic charge carriers and as electron donors that change the 
valence of Ti to Ti°?, thus introducing electrons in the conduction band, There is 
therefore a question about the origin of the enhanced electromechanical response: 
isit due to electronic screening of the polar discontinuity between the barrier layer 
and the bulk, or is it due to the strain gradient ‘squeezing’ the vacancies towards 
the convex side of the crystal, thus creating a defect concentration gradient with 
associated space-charge polarization? Such a Vegard-like ‘flexoionic’ mechanism 
‘would be the converse of the well-known electrochemical strain response of soft 
ionic conductors such as nafion or muscle tissue, the theory of which is described 
in refs 24, 36. 

‘Toaddress this question, we examine two issues: (1) the ratio between ionic and 
electronic conductivity in our measured range of temperature and frequency; and 
(11) whether a flexoelectric enhancement can be observed in a purely electronic 
semiconductor, 

First, let us examine the issue of electronic versus ionic conductivity in oxygen- 
deficient BTO, Electrons are lighter and more mobile than ions, so they are 
expected to dominate conductivity in ceramics at the relatively low temperatures 
(by ionic standards) in which we have performed our work. For BTO, even at 
temperatures as high as 750°C, ionic conductivity represents only about 20% of the 
total conductivity”; at 300°C. (our maximum temperature), the ionic contribution 
is expected to be exponentially lower. 

‘We attempted to quantify the actual ionic share of the conductivity in our sam- 
ples by extrapolating the ionic conductivity of BTO_» measured previously” at 
higher temperatures. Using equations (12) and (13) from ref, 23, and the vacancy 
concentration of §=0.12 that we measured (using EELS and gravimetry analysis), 
‘we obtain an ionic conductivity of oy,=4 x 10-°0- em~! at 300°C. This ionic 
conductivity would be higher than that of YSZ at the same temperature, a material 
that is already considered to be a good oxygen-vacancy conductor (itis the industry 
standard), We believe that this value is probably an imprecise overestimate because 
it is based on an extrapolation of measurements from the temperature range 
950-1,050°C down to T< 300°C; an extrapolation range that is much bigger than 
the initial fitting range is likely to result in an inaccurate estimate. Even leaving this 
important caveat aside, the ionic conductivity would still represent less than 5% of 
the total conductivity of BTO__¢at 300°C. (Extended data Fig. 4), 

So, although BTO_ sis indeed a mixed ionic-electronic conductor, in the tem- 
perature range we use, it behaves mostly as an electronic semiconductor with bar- 
rier layers caused by interfacial band-bending”. 

‘The ionic conduction model also seems to be inconsistent with the behaviour 
asa function of temperature. As temperature increases, ionic mobility increases 
exponentially, so a Vegard-like model would predict an exponential increase in 
bending- induced ionic drift—that is, increasing temperature should increase the 
bending-induced polarization. However, BTO_» shows the opposite trend (Fig. 3 
and Extended Data Fig. 3): above To, flexoelectricity decreases with temperature, 
showing a Curie-Weiss-like behaviour that is more consistent with a surface- 
piezoelectric model. 
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‘To determine whether enhanced flexoelectricity can be achieved in a purely 


electronic semiconductor, we performed measurements on Nb-doped TiO2 (0.05% 
Nb concentration by weight), which is a material for which the conductivity is 
purely electronic; each additional Nb** ion forces a Ti** ion to change its valence 
to Ti’? to preserve charge neutrality, thus adding an electron to the conduction 
band". The results are shown in Extended Data Fig. 5: Nb-doped TiO> displays 
>2,000 times more effective flexoelectricity than the undoped (insulating) TiOz 
‘crystal, suggesting that the effect is of electronic rather than ionic origin. This huge 
enhancement persists down to room temperature, at which vacancies, if there are 
any—and none are expected—should not be mobile enough to cross the 0.5-mm, 
thickness of the crystal in less than 100 ms, as is required by our measurement 
frequency of 13 Hz, 

‘Therefore, our conclusion is that the enhancement of effective flexoelectricity 
for semiconductors does not require ionic transport. 
Inverse flexoelectricity. The converse effect of bending induced by voltage can in 
principle also exist, but we believe that it will be much harder to observe in semi- 
conductors than in insulators for at least two reasons, First, Joule heating caused 
by the leakage current across the bulk will cause the semiconductor to expand, and 
this thermal expansion can be larger than any other electromechanical response 
from the sample. Second, the most likely origin of the low-conductivity interfaces 
is charge depletion caused by band-bending (that is, Schottky barriers). In any 
rectifying junction, these barriers are wiped out by the application of an external 
voltage that is larger than the band misalignment between the electrode and the 
semiconductor, so the maximum voltage that one can apply before the barriers 
become conducting is limited by the semiconductor bandgap. In practice, this, 
means that any external voltage that is larger than a couple of volts will essentially 
eliminate the barrier and turn it into an ohmic junction, thus removing the possi- 
bility of measuring its piezoelectric response. 
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Extended Data Figure 1 | Effective flexoelectric coefficients of 
semiconducting crystals of Nb-doped TiO, (0.05%Nb by weight) as a 
function of sample thickness. The red line is a linear fit to the data. 
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Extended Data Figure 2 | EELS analysis. Top, EELS spectra of a the EELS spectra of Sr TiOs_» (bottom-left; image reproduced from ref. 34, 
cross-sectional sample of BaTiO, measured in a transmission electron Macmillan Publishers Limited) or BaTiOs_¢ (bottom-right; reprinted from 
microscope, There is no monotonic trend as a function of distance tothe _ref. 35, with the permission of AIP Publishing) is consistent with 6<0.14 


surface, s0 no indication that the surface (at least to a depth of 1.4 jim) is for our crystals. 
any more (or less) oxidized than the bulk. A comparison with the shape of 
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Extended Data Figure 3 | Consecutive measurements of the flexoelectric 
coefficient for semiconducting BaTiO3_ 5. 
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Extended Data Figure 4 | Conductivity of BaTiO3_». Total conductivity 
7 =Aacuon + Ton Measured across the capacitor structure. 


imited, part af Springer Nature. All rights reserved. 


LETTER 


TIO, Nb-Ti0, 


1fencimy 


a 100 780 200 250 300 = 100 
Temperature(*C) 


0 200 «250 300 
Temperature(*C) 


Extended Data Figure 5 | Flexoelectricity of undoped TiO, and Nb-doped TiO. The conducting Nb-doped sample (right) displays an effective 
flexoelectricity that is >2,000 times larger than the insulating sample (left). Note that the units are nC m~! and wC m“~ for TiO; and Nb-TiO 
respectively. 
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Molecular transport through capillaries made with 


atomic-scale precision 


B, Radhal, A. Esfandiar', F. C. Wang’, A. P. Rooney’, 


Nanometre-scale pores and capillaries have long been studied 
because of their importance in many natural phenomena and their 
use in numerous applications', A more recent development is the 
ability to fabricate artificial capillaries with nanometre dimensions, 
which has enabled new research on molecular transport and led to 
the emergence of nanofluidics?+, But surface roughness in particular 
makes it challenging to produce capillaries with precisely controlled 
dimensions at this spatial scale. Here we report the fabrication of 
narrow and smooth capillaries through van der Waals assembly°, 
with atomically flat sheets at the top and bottom separated by spacers 
made of two-dimensional crystals® with a precisely controlled 
number of layers. We use graphene and its multilayers as archetypal 
two-dimensional materials to demonstrate this technology, which 
produces structures that can be viewed as if individual atomic planes 
had been removed from a bulk crystal to leave behind flat voids of a 
height chosen with atomic-scale precision. Water transport through 
the channels, ranging in height from one to several dozen atomic 
planes, is characterized by unexpectedly fast flow (up to 1 metre per 
second) that we attribute to high capillary pressures (about 1,000 bar) 
and large slip lengths. For channels that accommodate only a few 
layers of water, the flow exhibits a marked enhancement that we 
associate with an increased structural order in nanoconfined water. 
Our work opens up an avenue to making capillaries and cavities with 
sizes tunable to angstrém precision, and with permeation properties 
further controlled through a wide choice of atomically flat materials 
available for channel walls. 

‘There are two principal routes for making pores and capillaries with 
nanometre dimensions’. The top-down approach uses micro- and 
nano-fabrication techniques and has realized channels down to 2nm 
in average height’, but is fundamentally limited by surface roughness 
that is hard to reduce below a few nanometres using conventional 
materials and techniques’. The alternative bottom-up approach uses 
chemical synthesis with many advantages for scalable manufacturing, 
but which offers limited flexibility—especially for making capillaries 
with dimensions larger than several ingstrims, Notable exceptions 
are nanotubes of carbon and other materials, which offered opportu 
nities for studying mass transport through channels with nanometre 
diameters and atomically smooth walls!” and promised new kinds 
of membrane and nanofluidic systems. But it has proved extremely 
difficult to integrate nanotubes into macroscopic devices, which 
pethaps explains the continuing controversy about fast water transport 
through carbon nanotubes (CNTs): conflicting findings from the very 
few experimental groups who succeeded in studying their permea- 
tion properties!" have been discussed intensively in theoretical 
literature!*"“”, but with little further input from experiment. Graphene 
has also attracted considerable attention as a core material for mak- 
ing ultra-short nanopores'**, and gas, liquid, ion and DNA trans- 
port through such pores has been reported. But the fundamental 
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restrictions inherent to top-down and bottom-up techniques also limit 
the ability to control the diameters of graphene nanopores precisely. 
We overcome such problems by exploiting both the atomic flatness of 
graphene (which allows for relatively long channels with atomically 
smooth walls, somewhat similar to CNTs) and its atomic thinness 
(which, through stacking, provides atomic-scale control of the channel’s 
principal dimension, height). Our approach also preserves much of the 
flexibility offered by microfabrication techniques. 

Figure 1a explains the basic idea behind our nanocapillary devices: 
they consist of atomically flat top and bottom graphite crystals that 
are separated by an array of spacers made from few-layer graphene. 
Such structures are fabricated by van der Waals (vdW) assembly using 
dry transfer techniques* and a free-standing Si nitride membrane 
with a rectangular hole as mechanical support for the assembly. 
Figure 1b-d shows micrographs of some of our devices, For details of 
their fabrication, we refer to Methods section ‘Making nanocapillary 
devices’ and Extended Data Figs 1 and 2. We denote our devices by the 
number N of graphene layers used as spacers, The height I of the cavity 
available for molecular transport can then be estimated as Na, where 
a~3.4A is the interlayer distance in graphite, that is, the effective 
thickness of one graphene layer. All the capillaries reported here had the 
same channel width w= 130m, and 200 of them were incorporated 
within each device to increase molecular flow (Fig. 1). Their length L 
varied from <2jim to ~10,m. Despite the large aspect ratios w/h, we 
found no sagging of the graphite walls, which would cause capillary 
closure (Fig. Id and Extended Data Fig. 3). 

Under ambient conditions, all surfaces are covered with various 
adsorbates including water and hydrocarbons", and itis not unreason- 
able to expect that nanocapillaries could be blocked by contamination 
introduced during fabrication or adsorbed from the air. Accordingly, 
we first checked whether our devices were open for gas and ion 
transport. Extended Data Fig, 4 shows that this was the case and that 
He permeated through the capillaries. We carried out such He tests for 
practically all the devices and found them normally open, except for 
monolayer capillaries (N= 1), which never exhibited any detectable 
permeation. Devices with larger N gradually deteriorated and, after 
several days of measurements, often became blocked. We attribute this 
toa build-up of hydrocarbon contamination that creeps along surfaces 
and is present even under oil-free vacuum conditions in our He tests. 
On the other hand, if immersed in water, the capillaries showed much 
greater resilience, All the tested devices were found open (except for 
N= 1, again) and exhibited ionic conductance scalable with their 
dimensions (see Methods section ‘Ionic conductance’ and Extended 
Data Fig. 5). Capillaries kept in water did not get blocked for months 
and could be repeatedly measured, The fact that such artificial channels 
with a height down to the angstrom scale allow studies of molecular 
transport under normal (not ultra-high) vacuum conditions is perhaps 
the most surprising finding of this work. 
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Figure 1 | Graphene capillary devices. a, General schematic of devices. 
‘The arrow indicates the flow direction used in all the experiments. 

b, Scanning electron microscopy (SEM) image of a trilayer device (top 
view). The spacers that are clearly seen in the area not covered by the top 
graphite can also be discerned underneath, running all the way to the 
hole etched in the bottom graphite. Three of the spacers are indicated 

by dotted lines and the edge of the hole by the dashed line. c, SEM 


Given the intense interest in nanoconfined water and the high 
stability of our devices in water, we explored their properties with 
respect to water permeation using precision gravimetry. As sketched 
in Fig. 2a and described in full in Methods, we measured weight loss 
from a miniature container that was filled with water and sealed with 
a Sinitride chip incorporating a nanocapillary device (Fig. 2a inset and 
Extended Data Fig. 6). An example of such measurements is shown in 
Fig. 2a. The slope of the measured curve yields the water evaporation 
rate, Q. Because the total cross-section of our devices is typically 
<0.1j:m?, measurements with microgram precision over several days 
were required to achieve accurate determinations of Q. Figure 2b shows 
observed for six devices with the same height (N=3) but different L. 
Within our accuracy, Q was found to vary proportionally with 
1/L = (1/L), where L is the effective average length with respect to a 
viscous flow, and (...) denotes averaging over contributions from 
channels with different L (see equation (1)). This dependence on L 
unambiguously indicates that the observed evaporation rate was limited 
by water flow through capillaries, in agreement with additional 
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micrograph of a cross-section of another device showing an array of 
capillaries with cavity height h= 15 nm. d, Cross-sectional bright field 
image of a bilayer capillary (7 A) in a scanning transmission electron 
microscope (STEM). e, High-angle annular dark field (HAADF) image 

of the edge of the channel. The lamellae for cross-sectional imaging were 
made by focused ion beam milling (see Methods section ‘Visualization and 
characterization of graphene capillarie: 


experimental observations described in Methods section ‘Gravimetric 
measurements. The reproducibility of our gravimetry results can be 
judged from the scatter in the inset of Fig. 2b where Q values for the 
N=3 devices are normalized by their L. All the trilayer capillaries show 
practically the same Q= 10~' gs~' normalized for I jum length, which 
translates into a flow velocity of ~0.1 ms“! for the shortest device in 
Fig. 2b (L =~ 4 pm). Asa control, we fabricated devices following exactly 
the same fabrication procedures but without graphene spacers (N=0), 
in which case no weight loss could be detected. In addition, we tested 
our gravimetric set-up using micrometre apertures made in Si nitride 
membranes and found evaporation rates that agree well with those 
expected from theory (see Methods section ‘Gravimetric 
measurements’). 

Having proved the accuracy and reproducibility of our measure- 
ments using trilayer devices, we investigated how the capillary flow 
depended on N using more than 30 different devices. Figure 3 shows 
that, as / decreases from ~10nm (maximum height in our gravime- 
try experiments), Qalso decreases, as generally expected. However, 
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Figure 2 | Water permeation through graphene nanocapillaries. 
a, Weight loss due to water evaporation through one of our trilayer 
devices. Bottom inset, optical image (natural colour) of this particular 
device that has 200 parallel channels with L ranging from 3.6jum to 10.1 j1m, 
‘Top inset, a basic schematic of our gravimetric measurements. b, Water 
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evaporation rate (Q) measured for six trilayer devices with different 
effective L (symbols; see text for definition of L), The error bars indi 
the range of L within each device. Inset, same data but normalized by L. 
‘The heights of the bars correspond to the measured Q. 
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Figure 3 | Water flow through channels of different height. a, Examples 
of gravimetric measurements for various N (the number of graphene layers 
used as spacers), They were carried out at 21 °C in near zero humidity, 


and the curves are normalized for the effective length of the devices, L 
b, Dependence of Q on capillary height (the data are normalized by L and 
given per one channel). The blue error bar shows the s.d. for the data from 


for h<2nm, Q unexpectedly shoots up by more than an order of 
magnitude with respect to the trend exhibited by large-N capillaries, 
and a strong peak appears at N= 4-5 (Fig. 3b). Devices with monolayer 
spacers exhibited no detectable weight loss, similar to the case of N=0 
and in agreement with our He and ion-conductance tests. 

‘The entire evaporation process involves several steps, including 
transport of water vapour to capillary mouths inside the container, 
viscous flow through the graphene capillaries and subsequent diffusion 
and evaporation of transported water into air. To find out which steps 
affect the observed permeation, we carried out additional experiments. 
When the container was weighed upside down so that the liquid was 
in direct contact with the entries of the capillaries, exactly the same Q 
was recorded as in the upright position (see Methods section 
‘Gravimetric measurements). This is not surprising because, at 100% 
relative humidity (RH) inside the container and the contact angle of 
water on graphite” =~ 55°-85°, the channels should be filled with the 
liquid owing to capillary condensation’”*. Atomic force microscopy 
and Raman spectroscopy further confirmed that the water inside the 
capillaries was in a liquid state (see Methods). We also note that 
the water meniscus cannot reside inside the nanocapillaries, given that 
the observed Q values require a water surface area of ~1 jum? (as follows 
from the Hertz—Knudsen equation), that is, one to two orders of 
magnitude larger than the total cross-sectional area of the capillaries 
in our devices. Therefore, evaporation of the transported liquid must 
take place outside the mouths of the capillaries, and suggests an 
evaporating extended meniscus, which has been extensively studied in 
the literature for the case of macroscopic capillaries”, For our nanoscale 
openings, the extended meniscus is likely to involve an atomically thin 
layer of absorbed water that extends over micrometre distances being 
driven by high spreading pressures’ (Extended Data Fig. 7a). To assess 
the role of this water film in our case, we measured Q for different RH 
outside the container. Surprisingly, no difference was found with 
increasing external RH up to values close to the onset of capillary 
condensation (see Methods section ‘Gravimetric measurements’). This 
shows that Q is not limited by diffusion and evaporation processes 
outside the container, and that the limiting process in our system is 
instead liquid flow through the graphene channels—in agreement with 
the finding that Q depends only on the capillary parameters L and N. 

For long and wide rectangular channels with w/h> 1, liquid flow 
driven by pressure Pis described by 


3 
=p hy 


1 
12y\ ht 


where pis the water density, its viscosity and 6 the slip length. All 
these characteristics of nanoconfined water may depend on h. To find 
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Fig. 2b; the black error bar indicates the data scatter for two devices 
measured for this particular height. Such +25% reproducibility is also 
typical for other N, for which at least two devices were measured. Only the 
data for N= 12 and 20 are based on single-device measurements. The red 
error bar indicates our detection limit (no flow could be detected for 

N= 1). Inset, flow rates found in our MD simulations 


out whether equation (1) can explain the observed water transport 
behaviour, we performed molecular dynamics (MD) simulations using 
typical parameters for water-water and water-carbon interactions 
(Methods). Our analysis shows that 6 is large (~60nm) but does not 
vary much with ht (Extended Data Fig, 7), in agreement with previous 
MD results for flat graphene surfaces!""”, Also, changes in p are 
found to be relatively minor, reaching 4% for our smallest channels 
(Extended Data Fig. 8). The viscosity 7) increases by a factor of 2 for 
<5, which reflects the fact that water becomes more structured under 
nanoconfinement?”~, Using these parameters in equation (1), we find 
that Q detected for our smallest capillaries requires P of the order of 
1,000 bar. This is consistent with supporting transport measurements 
using containers pressurized at ~ 1.5 bar (close to the maximum 
pressure that our membranes could withstand), which revealed no 
difference in Q (see Methods section ‘Gravimetric measurements’). 
Itis difficult to perform MD simulations of the capillary pressure 
exerted by evaporating an extended meniscus because the low 
density of vapour necessitates a prohibitively large simulation vol- 
ume. Therefore, we introduce the following simplification. The 
extended curvature of the meniscus is determined by two spatial 
scales, its height hand length outside the capillary mouth (Extended 
Data Fig. 7a). The length is expected to depend on RH but this 
was not the case experimentally, which allows us to approximate 
the extended meniscus using only ht. With reference to Extended 
Data Figs 7a and 8a, both extended and internal menisci should 
have approximately the same height and involve the same inter- 
action of water with graphite. Therefore, in both cases, P can be 
approximated’ as Py + I = 2ocos()/h-+ IT, where the first term 
describes the pressure due toa curved meniscus (with o=72mN m-! 
the surface tension of water). Even for our largest channels, Py 
exceeds 10 bar. The second term [1 refers to the so-called disjoining 
pressure!" that describes the water-surface interaction, which can 
dominate at the nanoscale but rapidly decreases with increasing h. 
Our MD simulations (see Methods section ‘Capillary pressure’) show 
that, for large N > 10, P roughly follows the classical dependence Po, 
with = 80"; but the disjoining pressure becomes dominant at smaller 
N, reaching above 1,000 bar (Extended Data Fig, 8b). Combining 
the simulated P with the other flow characteristics found in our MD 
analysis, equation (1) yields the Q(N) dependence shown in the inset of 
Fig. 3b. It qualitatively reproduces our experimental findings, including 
the peak at small Nand even its absolute value. The physics behind the 
non-monotonic dependence Q(N) can be understood as follows. At 
large N, the classical contribution Py o 1/h dominates and equation (1) 
yields the linear dependence Qcch, in agreement with the trend 
observed in Fig. 3b for h > 3nm. Evaluating equation (1) numerically 
in the classical limit (P= Po), we find Q~10-"gs~!ym for h=10nm, 
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in agreement with the values in Fig. 3b, The marked increase in Q for 
small Nis due to the rapidly rising disjoining pressure, whereas the final 
fall in Q for smallest N occurs due to a combined effect of decreasing h 
and increasing 1, which both reduce Q, overtaking the rise in IT at 
small h. Note that, if it were not for the large enhancement factor 66/ht 
due to the low friction of water against graphene walls, the simulated 
flow would be well below our detection limit. 

‘The agreement between our model and the experiment is striking, 
especially if we consider the approximation used for calculating Pand 
the unresolved experiment-theory dispute” concerning water 
permeation through CN'Ts. Further work is required to fully understand 
the mechanisms involved, and, in particular, to model extended menisci 
with nanoscale dimensions. Finally, the observed closure of monolayer 
capillaries (N=1) seems to be not an accidental effect. Our MD analysis 
reveals that such narrow cavities are intrinsically unstable and collapse due 
to vdW attraction between opposite graphite walls (Extended Data Fig. 9). 

Our fabrication approach allows capillary devices to be prepared 
in which the channel height can be controlled with true atomic 
precision by choosing spacers of different two-dimensional crystals 
(such as graphene, boron nitride, molybdenum disulphide) and their 
combinations. One can also alter the chemical and physical character- 
istics of these capillaries (for example, change their hydrophilicity) by 
using different atomically flat crystals for channel walls. Furthermore, 
the availability of highly insulating materials such as boron nitride and 
mica allows the design of nanofluidic systems in which ionic or mass 
transport can be controlled by gate voltage. Our current devices transfer 
minute amounts of liquid, typical for nanofluidics, but it is feasible 
to increase the flow by many orders of magnitude using dense arrays 
of short (submicrometre) capillaries covering millimetre-sized areas, 
which could be of interest for nanofiltration, for example. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only inthe online paper. 
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METHODS 

‘Making nanocapillary devices. Our fabrication procedures are explained in 
Extended Data Fig. I. First, we prepare a free-standing Si nitride membrane of 
approximately 100 x 100 im" in size using commercially available Si wafers with 
'500nm thick Sinitride™, A rectangular hole (3 x 201m?) is made in the membrane 
using the standard photolithography and reactive ion etching (step 1). Then a 
relatively thick (>10nm) graphite crystal is deposited to seal the opening (step 2) 
using the dry transfer method described in Supplementary Information of 
ref, 33. On a separate Si wafer (with 3001nm of SiO2) we prepare multilayer 
graphene of a chosen thickness using micromechanical cleavage® to serve as a 
spacer. The graphene crystal is patterned by electron beam lithography and 
oxygen plasma etching to create an array of parallel stripes of ~130 nm in width 
and separated by the same distance (Fig. 1a). These dimensions are chosen to 
obtain sufficiently narrow channels (to prevent them from collapsing; see below) 
and, at the same time, to ensure full reproducibility using our lithography facilities. 
‘The graphene stripes are then transferred onto the bottom graphite so that they 
are aligned perpendicular to the long side of the rectangular opening (step 3). 
Oxygen plasma etching is employed to drill through the graphite-graphene stack 
using the hole in Si nitride as a mask (step 4). Finally, another graphite crystal 
(approximately 100 nm in thickness) is ‘dry-transferred’ to serve as the capping, 
layer. This completes a set of graphene capillaries, such that their entries and exits, 
are accessible from the opposite sides of the Si wafer (step 5). After each transfer, 
the assembly is annealed at 400°C for 3h to remove possible contamination. 
‘Visualization and characterization of graphene capillaries. In addition to 
Fig, 1, Extended Data Fig. 2 provides further examples of imaging of our graphene 
capillaries including their optical, atomic force microscopy (AFM), SEM and STEM 
micrographs. We used SEM and optical images such as in Fig, 1b and Extended 
Data Fig. 2a to measure lengths of our devices and calculate their average length 
with respect to frictional flow, = (1/L)~'. For most of our devices, L varied by 
less than 30% (for example, Extended Data Fig. 2a) and, accordingly, we found no 
qualitative difference if using L or (L) in our analyses. 

‘To obtain the cross-sectional SEM images shown in the figure, we used a dual- 
beam system (Zeiss Crossbeam 540), which combines electron microscopy with 
focused ion beam (FIB) capabilities. The region of interest was located using SEM, 
and a protective Pt layer (~0.5j1m thick) was deposited on top. Then a trench 
was milled using 30kV Ga* beam at 0.1 nA current, which exposed the device’ 
cross-section. Two additional polishing steps at 10 and 1 pA were subsequently 
carried out using the same 30kV Ga* beam. During the final step, the raster had 
a.width of 200mm, and it took approximately 30min to complete the polishing. 

Samples for STEM were obtained by implementing the in situ lift-out 
procedures*'*3*" in a FIB system (Helios Nanolab DualBeam 660). A cross- 
sectional lamella (that is,a thin foil cut out perpendicular to the capillary axes) was 
prepared by FIB milling and lifted from the substrate using a micromanipulator, 
aided by ion beam deposition of platinum. Afier transfer toa specialist OmniProbe 
¢gtid, the foil was thinned down to <100nm and then polished to electron transpar- 
ency using SkV and subsequently 2kV ion milling. High-resolution STEM images 
were acquired in an aberration-corrected microscope (FEI Titan G2 80-200kV) 
using a probe convergence angle of 21 mrad, a HAADF inner angle of 48 mrad 
and a probe current of ~80 pA. To ensure that the electron probe was parallel to 
graphite planes, the cross-sectional sample was aligned to the relevant Kikuchi 
bands of the Si substrate and graphite 

‘To prevent closure of our nanocapillaries through sagging of their walls, itis 
essential to choose appropriate values for the channel width, w, and top graphite’s 
thickness, H (bottom graphite is supported by the substrate which stops it from 
sagging). To illustrate the crucial role of H, Extended Data Fig, 3 shows AFM 
images of trilayer channels covered with top graphite of a varying thickness, One 
can see that thin graphite (H =12nm) sags—at least partially—into the channels 
‘whereas the thicker layer (521m) remains atomically flat, which suggests that the 
channels underneath are likely to remain open. For our standard channels with 
w= 130nm, we find that it requires H > 50nm to avoid their collapse. On the other 
hand, for w>S00nm, the top graphite crystal in our experiments always sagged 
into the channels (even for H > 200nm Extended Data Fig. 3b). 

‘To confirm the presence of liquid water inside our nanocapillaries, we carried 
out their Raman spectroscopy. A working device with N= 30 was fabricated 
following the procedures described above but, instead of graphite, hexagonal 
boron nitride was used as the top layer. This provided optical access for a laser 
beam whereas hexagonal boron nitrides mechanical and tribological properties 
are similar to those of graphite. We chose the large N to increase the amount of 
‘examined water as its Raman signal is relatively weak. Ifliquid water was placed 
inside the container, a clear signature of water (peak at ~3,250em™ using alaser 
wavelength of 785m) appeared in the Raman spectra taken from the area with 
graphene capillaries. No signal could be detected at low RH or outside the area 
Itis also instructive to mention that capillaries with small sagging (<0.5 nm) 
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reacted to high RH in such a way that the sagging disappeared and the top graphite 
layer became flat in the AFM images. For example, for capillaries with N= 5 this 
straightening of graphene walls happened at ~70% RH, indicating the onset of 
capillary condensation’. This allows an estimate for the contact angle (955°, 
in agreement with ¢ observed for water on clean graphite surfaces”. No changes 
with increasing RH were observed for sufficiently thick top layers that exhibited 
no initial sagging and were used in the studied devices. 

Helium permeation. To ensure that the fabricated capillaries are not blocked 
by sagging or contamination, we checked gas permeation through them using 
a helium-leak detector (INFICON UL200). A principal schematic of our 
experimental set-up is shown in Extended Data Fig. 4a, In short, a Si wafer 
with a capillary device is clamped between O-rings and separates two oil-free 
‘vacutum chambers. One of them is equipped with pressure gauges and a pump to 
allow control of the applied helium pressure P, at the capillary entry. The other 
chamber is connected to the leak detector. We have found our graphene-Si nitride 
membranes sufficiently robust to withstand P, up to 2bar. Examples of our tests 
are shown in Extended Data Fig. 4b, Except for devices with N=0 and 1, all other 
nanocapillaries allowed He permeation. 

Although discussion of gas transport through graphene nanocapillaries is 
beyond the scope of the present report, itis instructive to compare the observed 
He rates Q with those expected theoretically. For a channel with h much smaller 
than the mean free path [= 1401nm for He atoms at the atmospheric pressure, their 
mass transport is described by the Knudsen formula®> 
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Where My is the atomic mass of He. For narrow-slit channels, the transmission 
coefficient « can be approximated by a= 5(h/L). In the case of h= 15m and 
P,=100 mbar, equation (2) yields Q=7 x 10" gs~!jum, in good agreement with 
‘our measurements shown in Extended Data Fig. 4b for N= 45. On the other hand, 
smaller capillaries (N'<5) are found to exhibit leak rates that are nearly two orders 
‘of magnitude higher than the rates expected from equation (2). Moreover, their Q 
‘were even greater than those found for 10 times higher channels (Extended Data 
Fig. 4b), contrary to general expectations, A similar enhancement of He flow was 
previously reported for sub-2-nm CNTs and attributed to the atomic smoothness 
‘of graphene walls! 

For the case of a water vapour driven by the difference in RH (differential 

pressure of 23 mbar), equation (2) yields evaporation rates of ~5 x 10-"°gs-!jum 
and 400 » 10-"gs~ im for channels with N= and 45, respectively, which is 3-5 
‘orders of magnitude smaller than the experimental values in Fig. 3b. Even if the 
‘vapour permeation is enhanced by two orders of magnitude, as observed for the He 
transport through capillaries with N= 5, this still leaves three orders of magnitude 
unaccounted for. This disagreement provides yet another indication that water 
permeates through our graphene channels as a liquid. 
Tonic conductance. We also tested a number of capillary devices using the 
electrochemical set-up shown in Extended Data Fig, Sa. KCl solutions of different 
concentrations C were introduced into two reservoirs separated by a Si wafer 
incorporating a graphene device under investigation. Possible air bubbles were 
removed by extensive flushing from both sides of the Si wafer. Current-vollage 
(I-V) characteristics were recorded using Keithley 2636 A SourceMeter and 
Ag/AgCl electrodes. Extended Data Fig. 5b, c shows examples of our measurements 
for two devices with N=2 and 17. The I-V curves are linear at low biases and 
exhibit little hysteresis. At high C, the observed ionic currents for a given voltage 
differ approximately by a factor of ~8, in good agreement with the ratio between 
the channel heights. Devices with N=0 and I exhibited no detectable ionic 
conductance. 

Extended Data Fig. 5d shows that the ionic conductance, G, increases linearly 
with C for ionic concentrations higher than 10-7M, and its absolute value agrees 
‘well with the values expected from the known bulk conductivity of KCI solutions, 
In the low concentration regime (<10~*M), G saturates to a constant value, the 
same for both devices. Such saturation is typical for nanocapillaries and attributed 
to the surface charge effect", In our case, the saturation value is very small and, 
taking into account electro-osmotic and finite-é contributions”, we find a surface 
charge density of ~3 x 10"° em”, orders of magnitude lower than the values 
observed for other nanocapillaries including CN'1Ts™. This serves as another 
indication that graphene walls of our channels are impurity-free, in agreement 
swith low charge densities usually found in graphene-based vdW heterostuctures*, 
Gravimetric measurements. The set-up used in our studies of water permeation 
is shown in Extended Data Fig. 6a,b. The assembled capillary device was mounted 
‘on top ofa container partially filled with deionized water. The container was then 
placed on a microbalance (Mettler Toledo XPE26) and weighed in an enclosure 
‘with a constant temperature (typically, 21 +0.1 °C) and at near 0% humidity that 
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‘was maintained using molecular sieves. The weight of the container was recorded 
at regular intervals (typically, | min) using a computer. 

‘To verify the accurate operation of the gravimetric set-up, we prepared reference 
devices with round apertures of different diameters, D, etched in Si nitride 
membranes. Using the same sample mounting and measurements procedures as, 
for our graphene devices, we measured water evaporation through the apertures 
(Extended Data Fig, 6). The Knudsen numbers for our apertures are small and 
the evaporation can be described by diffusion of water molecules through air. The 
molecular flow Fis given by"? 


1) dn 

Eider 
where (v) isthe average velocity of molecules in air, [=:60nm is the mean free path, 
and dn/dx the concentration gradient, ‘To leave the container, water molecules 
have to diffuse through air over a distance of about D, which allows an estimate 
dn/dx= An/D where An= AP/(ky1) is the difference in water concentrations at 
large distances from the aperture and AP the difference in their partial pressures. 
‘The diffusion problem can be solved exactly for the case of infinitely thin orifices, 
which isa reasonable approximation for our 500-nm-thick Si nitride membranes 
and yields” 


where Myo is the molecular weight of water. This equation yields Qo« D, in 
agreement with the observed behaviour in Extended Data Fig. 6d. The counterin- 
tuitive linear dependence arises because the available area for diffusion increases 
proportionally to D? whereas the diffusion length decreases as 1/D. Using AP=23 
mbar, the above equation yields ~1,7 x 10- gs! x D (inum), which is within 
15% from the best fit in Extended Data Fig. 6d, Importantly, the measurements for 
our aperture devices cover approximately the same range of Qas that found for 
‘graphene capillaries (Fig, 3). The excellent agreement between the experiment and 
theory confirms reliability of our gravimetry set-up. 

‘To narrow down the range of possible explanations for the observed fast water 
flow, two additional sets of experiments were carried out. First, using devices 
with N=5 and 30, we increased RH outside the container up to 50% and >70%, 
respectively, using increments of 20%, Surprisingly, no changes in Q could be 
detected. At even higher external RH, the evaporation completely stopped (at ~90% 
for N=30), which is attributed to condensation at the output side of the channels. 
‘This shows that it was not necessary to maintain RH accurately at zero and confirms 
‘once again that it was not the differential vapour pressure that drove the water flow. 
Most importantly, these experiments indicate that water diffusion and evaporation 
‘outside the capillaries was nota limiting factor in our permeation measurements, 
Otherwise, the increase in external RH would significantly reduce Q. 

In the second set of experiments, we applied an additional pressure of 
1.3-£0.3bar to the water column inside our containers. This pressure was chosen 
tobe close to the maximum pressure that our membranes could withstand. To 
create such pressures while keeping the container weight below ~15g (required for 
precision gravimetry), a chosen amount of NaBH, was dissolved in water inside the 
container which resulted in a slow release of hydrogen (over several hours at room 
temperature). The pressure build-up inside a closed container was monitored in 
a separate experiment (without a graphene device) and quantitatively agreed with 
the pressure expected from the chemical reaction. The extra pressure did not lead 
toany discernible difference in Q. This unambiguously proves that P much higher 
than Ibar drove water through the capillaries, and our measurement accuracy of 
~10% yields a lower bound estimate for P as 15 bar. 

‘Molecular dynamics simulations. fo understand the observed behaviour, we 
used both non-equilibrium and equilibrium MD simulations (NEMD and EMD, 
respectively), Water molecules were confined between two rigid graphene sheets 
of approximately 5 x Sm’ in size and separated by t= aN (Extended Data 
Fig, 7b). Unless specifically mentioned below, we used the SPC/E model for water, 
and the carbon atoms were modelled as fixed neutral particles interacting with 
‘oxygen through the Lennard-Jones (LJ) potential with the standard values" of 
the interaction parameters, <co and aco. For consistency, inal the presented sirmu- 
lations we used ¢co=0.0927 kcal/mol and a =3.283 A, and L] interactions were 
truncated using a cut-off of 10. The temperature of water was maintained at 300K 
using the Berendsen thermostat, Long-range Coulomb forces were computed using 
the particle-particle particle-mesh method, and all the simulations were carried 
‘out in the canonical ensemble using LAMMPS". The graphene capillary shown 
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in Extended Data Fig. 7b was initially connected to two reservoirs that contained 
5,000 water molecules each. A pressure of 1 bar was applied to the water reservoirs 
to ensure equal pressure on water molecules inside capillaries of different h. Then 
the reservoirs were removed and periodic boundary conditions were applied in 
all three directions. 

In our NEMD analysis, the flow was generated by applying a constant unidi- 
rectional acceleration of 10m s-* to all atoms in water, which corresponds to a 
pressure gradient of ~10'* Pam”, Such large gradients are standard for NEMD 
simulations and necessary to obtain statistically significant results", The steady 
flow state was achieved after ~1 ns, and the data were collected for > 10 ns to find 
the streaming velocity V. The flux was calculated as Q= pwhV where p is the 
average density of the nanoconfined water and w the capillary width perpendicular 
to the flow direction. For the known Q, the slip length 6 can be found using 
equation (1). Our NEMD results are presented in Extended Data Fig. 7c 

‘We also used EMD simulations to find 6 = 1/\ which is given by the ratio of 
the shear viscosity 7 to the liquid-solid friction coefficient, \. Both 1 and A were 
calculated through the Green-Kubo formalism using the simulated local structure 
of confined water'®"®, We found that 7 was in the range of (0.5-0.9) x 10™* Pas 
and \ was about 10°kgm~75~", in agreement with the previous simulations for 
the water-graphite interface", This yielded 653+ 8nm for N ranging from 
2 to 30 (Extended Data Fig. 7c), Note that CNTs are known to exhibit a strong, 
dependence of 6 on their diameter'*, which is attributed to the effect of curvature. 
No kt dependence was found for planar graphene channels either in our simula- 
tions or previously'™”, For example, Falk et al." reported 6 80nm for h ranging. 
from 0.4nm to 4nm, and Kannam et al.” found 6~ 60+ 6nm for h=4nm. The 
relatively minor discrepancies can be attributed to details of MD simulations such 
as different interaction parameters and different thermostats, 

Despite usual" quantitative differences between NEMD and EMD simula- 

tions (Extended Data Fig. 7c), both show qualitatively the same behaviour with a 
rapid decrease in water flow with decreasing N (approximately, «ch? as expected 
from equation (1) for a constant pressure P) and without any anomalies at small N. 
‘This is in agreement with the previously reported simulations for fat graphene 
capillaries’, We also tried other models for water (TIP4P/2005)*" and its 
interaction with graphene" as well as the use ofa flexible graphene confinement. 
However, ifa pressure P was assumed independent of h, we found it impossible to 
‘obtain a peak in permeation at small N. 
Capillary pressure. ‘fo analyse changes in the capillary pressure P with decreasing 
N, we used the MD set-up shown in Extended Data Fig. 8a. As discussed in 
the main text, the internal meniscus was chosen as an approximation for the 
evaporating extended meniscus sketched in Extended Data Fig. 7a. Both have 
approximately the same height h determined by N and involve same interactions 
of water molecules with graphite. Note that the extended meniscus is driven by 
interactions with one graphite surface, which should result in a somewhat smaller 
II with respect to the modelled internal meniscus. This should lead to better 
agreement with our experimental results but possible corrections are neglected 
below. Water was supplied into the graphene capillaries from a relatively large 
reservoir placed on the left. The reservoir was terminated with a rigid graphene 
sheet that was allowed to move freely from left to right. Capillary pressure sucked 
water inside the channel and forced the sheet to move to the right. We applied a 
‘compensating force in the opposite direction to keep the sheet stationary. From 
the found force and the known cross-sectional area of the channel, the pressure 
Pras calculated. 

‘The results are shown in Extended Data Fig. 8b (solid symbols). The simulated 
capillary pressure rises notably faster than that expected from the classical term due 
to the curved meniscus (red curve). The steeper increase in P can be understood 
as due to the disjoining pressure JJ that consists of several contributions, 
including the vdW pressure IJygw and entropic terms. The latter appear because 
of different densities of water inside and outside graphene nanocapillaries'®”” as 
‘well as the enhanced structural order in nanoconfined water*7%124°39. tn our 
case, changes in p are relatively minor (inset of Extended Data Fig, 8b) leading 
to the corresponding entropic pressure'*” of <50bar (magenta curve). Also, 
Tay =Al(6rI") presents a relatively small effect, where A is the Hamaker constant, 
for water-graphite interaction". The [T,aw contribution becomes notable only 
for N<3 because of the rapid h~* dependence (blue curve). he total of the 
above three contributions is shown in Extended Data Fig. 8b by the green dashed 
curve, The remaining difference with respect to the MD-simulated dependence 
can be attributed to the entropic pressure due to the increased structural order in 
nanoconfined water"”-". 

Intrinsic collapse of monolayer capillaries. ‘To understand the complete block- 
age observed for all our devices with N= 1, we performed the following MD 
simulations. Graphene capillaries were modelled as flexible graphene layers 
stacked on top of each other with the interlayer distance a. One or two graphene 
layers were partially removed in the middle to create channels of 20nm in width 
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(Extended Data Fig. 9). We found that the walls of monolayer channels sagged 
already after several ps, independent of the thickness of graphite walls. In stark 
contrast, bilayer channels remained open. This behaviour is attributed to vaW 
attraction between capillary walls, which is sufficiently strong at short distances 
to deform the graphite bulk but rapidly vanishes with increasing the separation”. 
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Extended Data Figure 1 | Microfabrication process flow. (1) A transferred to cover the resulting aperture. The accompanying optical 
micrometre-scale hole is prepared in a silicon nitride membrane. images (in natural colours) illustrate the results after each step for one of 
(2) Bottom graphite is transferred to cover the opening. (3) An array our devices. Graphene spacers are invisible in the photos and indicated by 
of graphene spacers is transferred on top. (4) The hole is extended into an opaque rectangle in (3), Steps 3 and 4 were often interchanged. 


the graphite-graphene stack by dry etching. (5) Top graphite crystal is 
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Extended Data Figure 2 | Additional images of graphene capillaries. 
a, Optical image of a final device. The green region is the free-standing 
silicon nitride membrane. The Si wafer is seen in brown and the top 
graphite crystal (arrowed) in yellow. Red, yellow and grey contours 
indicate positions of the top graphite, bottom graphite and graphene 
spacers, respectively. The nearly-vertical dark lines are wrinkles in the 
bottom layer. b, AFM image of four-layer graphene spacers on top of a 
bottom graphite crystal (height profile along the dashed line is shown 
below the image). Inset, high-resolution scan (friction mode) from the 
region indicated by the arrow. The observation of the atomic lattice 
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confirms that our assemblies have atomically smooth surfaces. Such 
smoothness is impossible to achieve using conventional materials and 
processes that invariably lead to the surface roughness exceeding the 
scale given by few-layer graphene spacers. Although the side walls of our 
channels are rough due to limitations of electron-beam lithography, we 
estimate that, because of the large ratios w/h, the side wall contribution 
to the flow resistance cannot exceed 5% even for our 10 nm devices™. 

¢, SEM micrograph of a capillary device with h~ 15nm. d, Bright field 
STEM image of a graphene capillary with N=4, Spacers and channels are 
arrowed ine and d. 
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Extended Data Figure 3 | Sagging of top graphite. a, Left, AFM image ‘We can see that the top graphite bends down into the channels over their 
of trilayer channels, which are covered by a graphite layer of varying entire height h~ 5 nm. Right, height profiles that correspond to the traces 
thickness. b, Left, partial sagging of the top graphite into wide channels. shown by the dashed lines in the AFM images at left. 
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Extended Data Figure 4 | He leak through graphene capillaries. represented by filled orange circles. b, Leak rates normalized for 1 jim 
a, Schematic of our set-up. Two vacuum chambers are separated by the length and given per channel as a function of applied pressure for capillary 
silicon nitride wafer incorporating a nanocapillary device. Valves connect devices with N=5 and N= 45 (l= 1,7nmand 15nm, respectively), and a 
the chambers to a pump, a He leak detector and a gas inlet. He atoms are control device without graphene spacers (N=0) 
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Extended Data Figure 5 | Ion transport through graphene 
nanochannels. a, Schematic of our measurement set-up. A nanocapillary 
device fabricated on top of a Si nitride wafer (SiN is shown in green) is 
clamped using O-rings (black) to separate two containers (indicated by 
magenta lines), The containers are filled with a KCI solution (blue), and. 
silver chloride-silver wires (dark grey) are used as electrodes to measure 
ionic conductance. b, Examples of current-voltage (J-V) characteristics of 
the smallest capillary devices (N =2) at different KCl concentrations 
(labelling the curves; L ranges from 2.8j.m to 7m). ¢, Same as b but for a 
device with N= 17 of approximately the same average length L (L from 


50 


10° 10° 10° 107 10° 10° 
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1.7m to 7.3m). d, Ionic conductance for these devices as a function of 
KCl concentration, C (without normalizing for their slightly different L). 
Both blank Si nitride wafers separating the reservoirs and control devices, 
with N=0 (no spacers but otherwise prepared using the same fabrication 
procedures) exhibited leakage conductance of the order of 20 pS, which 
did not change with C (olive symbols). The dashed lines show ionic 
conductance G expected from the bulk conductivity of KCI for the given 
channel dimensions. The solid curves are fits taking into account an 
additional parallel conductance due to the surface charge 
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Extended Data Figure 6 | Gravimetric measurements, a, Extended 
schematic of the experimental set-up. A small aluminium container filled 
with water is sealed with a Si nitride wafer containing a graphene capillary 
device (total weight should not exceed ~15g to allow the required 
‘measurement accuracy). The container was weighed either upside 

down (water in contact with capillaries as shown in the sketch) or in the 
upright position as shown in the inset of Fig. 2a (capillaries are exposed 
to 100% RH). Both orientations resulted in the same Q. b, Photographs 
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of our gravimetric set-up. Main image, microbalance with our miniature 
container being weighed (its position is indicated by a dashed square). 
Image to the right, the container is open and the Si nitride wafer (that 

is clamped between the O-rings during measurements) is removed. 

¢, Examples of water evaporation through apertures of different diameters, 
D (colour coded) d, Dependence of the evaporation rate on D (error bars, 
s.d.), Red line, best linear fit. Inset, optical micrograph (natural colour) of 
an aperture of 30 um diameter, which is etched in a Si nitride membrane. 
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Extended Data Figure 7 | Molecular dynamics simulations of water carbon atoms arranged into graphene planes. Red and light grey spheres 
flow through graphene slits. a, Our capillaries are filled with water show oxygen and hydrogen atoms of water molecules. ¢, Simulated slip 
and the driving pressure is determined by evaporation of the extended length and water flux Q as a function of N. In the case of NEMD, 


meniscus that appears at the capillary mouth. The meniscus is sketched in__& (blue symbols) was calculated from the simulated Q (blue bars) using 

the drawing, showing a thin film of water propagating along the graphite equation (1). Using 6 found from the EMD simulations (grey symbols) and 
surface*, b, MD set-up with the simulation box indicated by the black the pressure gradient of 10'° Pam, equation (1) yields Q shown by the 
lines, The particular snapshot is for N=4. Dark grey balls represent grey bars. 
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Extended Data Figure 8 | MD simulations of capillary pressure. 

a, Our MD set-up for N=4. Colour coding as in Extended Data 

Fig. 7b, Graphene planes to the right represent a graphite crystal with four 
atomic planes removed. The vertical graphene plane is used as a movable 
membrane to apply a compensating force to stop the water meniscus from 
propagating to the right. b, Main figure, simulated capillary pressures 
(symbols with s.d. error bars). The red curve shows the best fit for large 
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Nusing Po=2ccos()/h, which yields = 80°. Blue and magenta curves, 
show /Tyaw with the Hamaker constant A ~ 115 2J (ref. 53), and the 
entropic pressure due to changes in p, respectively. Dashed green curve, 
combined pressure from the three contributions. Inset, simulated density 


of water confined between graphene sheets under external pressure of 
bar. 
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Extended Data Figure 9 | Micromechanical stability of graphene a-d, Capillaries with different thicknesses of graphite walls, respectively 2, 
cavities. Shown are snapshots of mono- and bilayer capillaries 6,20 and 40 graphene layers. 
(left and right columns, respectively) after 100 ps of MD simulations. 
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Evolution of global temperature over the past two 


million years 


Carolyn W. Snyder! 


Reconstructions of Earth's past climate strongly influence our 
understanding of the dynamics and sensitivity of the climate 
system. Yet global temperature has been reconstructed for only a few 
isolated windows of time", and continuous reconstructions across 
glacial cycles remain elusive. Here I present a spatially weighted 
proxy reconstruction of global temperature over the past 2 million 
years estimated from a multi-proxy database of over 20,000 sea 
surface temperature point reconstructions. Global temperature 
gradually cooled until roughly 1.2 million years ago and cooling 
then stalled until the present. The cooling trend probably stalled 
before the beginning of the mid-Pleistocene transition’, and pre- 
dated the increase in the maximum size of ice sheets around 0.9 
million years ago*®. Thus, global cooling may have been a pre- 
condition for, but probably is not the sole causal mechanism of, 
the shift to quasi-100,000-year glacial cycles at the mid-Pleistocene 
transition. Over the past 800,000 years, polar amplification 
(the amplification of temperature change at the poles relative to 
global temperature change) has been stable over time, and global 
temperature and atmospheric greenhouse gas concentrations have 
been closely coupled across glacial cycles. A comparison of the new 
temperature reconstruction with radiative forcing from greenhouse 
gases estimates an Earth system sensitivity of 9 degrees Celsius 
(range 7 to 13 degrees Celsius, 95 per cent credible interval) change 
in global average surface temperature per doubling of atmospheric 
carbon dioxide over millennium timescales. This result suggests that 
stabilization at today’s greenhouse gas levels may already commit 
Earth to an eventual total warming of 5 degrees Celsius (range 3 to 
7 degrees Celsius, 95 per cent credible interval) over the next few 
millennia as ice sheets, vegetation and atmospheric dust continue 
to respond to global warming. 

Reconstructions of several key climate variables are available with 
high temporal resolution across past glacial cycles, such as polar 
temperature, atmospheric greenhouse gas (GHG) concentrations, sea 
surface temperature (SST), deep-water temperature (DWT) and sea 
level (see, for example, Extended Data Tables 1-3). Yet global average 
surface temperature (GAST) has been reconstructed for only a few 
isolated windows of time', and continuous reconstructions across 
glacial cycles remain elusive. The lack of continuous GAST reconstruc- 
tions has constrained model-data comparisons to particular extreme 
points in time, such as the Last Glacial Maximum (LGM), but multiple 
time points are critical for characterizing the uncertainty in relation- 
ships estimated from palaeoclimate reconstructions”*, The potential 
power ofa continuous GAST record has been recently demonstrated’; 
GAST was reconstructed for the past 22,000 years (kyr) and used to 
clarify carbon dioxide’ role in driving global climate change across 
glacial cycles. The present research creates a continuous record of 
GAST across.a much longer timescale. 

Previous continuous reconstructions of GAST across glacial cycles 
used only a single proxy record that was scaled linearly**°-" or 
modelled’ to estimate global values. This Letter presents a spatially 
weighted proxy reconstruction of GAST over the past 2 million years 


(Fig. 1a), estimated using a multi-proxy database compilation of over 
20,000 SST point reconstructions from 59 ocean sediment cores 
(Extended Data Tables 1, 2). This research uses probabilistic simulations 
across multiple sources of uncertainty to estimate credible intervals at 
L-kyr intervals, and validates the new reconstruction against previous 
estimates. The new GAST reconstruction can provide key insights into 
several major palaeoclimate questions, including the magnitude and 
stability of polar amplification, the state dependence of Earth system 
sensitivity (ESS, see below), and the role of global temperature in 
the mid-Pleistocene transition (MPT). 

A comparison of GAST to Antarctic temperature reconstructions 
for the past 800 kyr finds that GAST and Antarctic temperature! are 
closely coupled across glacial cycles with a correlation of 0.72 (0.59- 
0.81, 95% credible interval, hereafter ‘interval’)—a high correlation 
given that the GAST reconstruction is estimated independently of the 
ice core records. There is a linear relationship of 0.61 °C (0.43-0.85°C, 
95% interval) change in GAST for every 1°C change in Antarctic 
temperature (Fig, 2a) that does not significantly change over the past 
800 kyr (Extended Data Fig. 6a). Some previous research on climate 
sensitivity over the past 800 kyr has assumed that changes in GAST are 
similar to half the magnitude of changes in Antarctic temperature”!*", 
On the basis of the new GAST reconstruction, there is an 87% 
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Figure 1 | Reconstruction of global average surface temperature (GAST) 
over the past 2 million years compared to other key palacoclimate 
variables. a, GAST as temperature deviation (in°C) from present (average 
over 0-5 ka) in blue. b, Stacked reconstruction of change in Antarctic 
temperature" (°C) in cyan. ¢, Stacked reconstruction of atmospheric CO, 
concentrations" (p.p.m.) in red, d, Stack of deep-sea oxygen isotopes*®, 
50 (%0), in grey. In all panels, the solid black lines show the median 
estimate and the colour shaded areas show the 95% interval 
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Figure 2 | Relationship of changes in GAST to changes in Antarctic 
temperature and GHG radiative forcing over the past 800 kyr. 
a,b, Each point represents randomly sampled estimates from simulations 
of GAST plotted against Antarctic temperature" (a) and GHG radiative 
forcing'*!78 (b) over the past 800 kyr. The dashed black line shows 
the median estimated relationship in°C per°C in aand in°C per W m~ 
in b. The red dashed line shows the median estimated quadratic 
relationship in b 


probability that such an assumption underestimates global temperature 
and thus climate sensitivity. 

Polar amplification can be estimated as change in Antarctic temper- 
ature for every 1°C change in GAST, here estimated as 1.6°C per°C 
(1.2-2.3°C per °C, 95% interval). Estimates of polar amplification 
are complicated because the elevation of ice sheets changes during 
glacial cycles due to changes in accumulation and isostasy”!"*, and 
current ice sheet glaciological models disagree markedly with the ice 
topography used in LGM model simulations for Antarctica’, Climate 
models estimate that polar amplification of uncorrected Antarctic 
temperature will be nonlinear and lower in colder states, ~2°C per°C 
for the LGM and ~1.2°C per°C for future warming*"®, but ~1.2°C 
per°C for both when Antarctic temperature is elevation-corrected'®. 
‘The present research uses a reconstruction of Antarctic temperature 
that is elevation-corrected", but corrections are highly dependent upon 
uncertain ice sheet assumptions'*!®, A comparison of the new GAST 
reconstruction with Antarctic temperature finds a quadratic relation- 
ship to not be significant as predicted by the models (Fig, 2a), suggesting 
the elevation correction of Antarctic temperature is adequate, However, 
the magnitude of the polar amplification estimate from the new GAST 
reconstruction is significantly higher than predicted by many models 
(97.5% probability above 1.2)”"*. It is worth noting that those same 
models underestimate elevation-corrected Antarctic temperature 
change at the LGM by a factor of 1.22.3 (ref. 16). 

Other research has assumed that changes in DWT can be used as a 
direct proxy for GAST*"” or doubled to estimate GAST". A comparison 
between GAST and 12 DWT reconstructions from three different 
methods* finds highly variable results, with median correlations 
varying between 0.3 and 0.8 and median linear relationships varying 
between 1.4°C and 3.5°C change in GAST per 1°C change in DWT 
(Extended Data Table 3). The observed attenuation of the global tem- 
perature signal and the reduced correlation may be caused by deep-wa- 
ter cooling being limited by water's freezing temperature and/or 
by changes in ocean circulation*®"”, These results demonstrate 
the high uncertainty in inferring GAST from any single DWT 
reconstruction. 

High-resolution estimates of atmospheric GHG concentrations are 
also available from Antarctic ice core records!" over the past 800 kyr. 
‘The GAST reconstruction reveals a remarkably stable relationship 
between GAST and GHG radiative forcing'*'”"'* over the past 800 kyr 
with a correlation of 0.82 (0.66-0.92, 95% interval), stronger than 
the correlation between GAST and Antarctic temperature (Fig. 2b). 
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The concept of Sicuaj has been defined? as the total global climate 
response over millennial timescales from changes in ice sheets, dust 
and vegetation, as well as from the feedbacks included in ‘equilibrium 
climate sensitivity’ —water vapour, lapse rate, sea ice, snow cover, clouds 
and ocean heat uptake, but does not include carbon cycle feedbacks!*, 
A comparison of the new GAST reconstruction with GHG radiative 
forcing estimates Siguiqy a8 2.5°C (1.8-3.6°C, 95% interval) change in 
GAST per 1 W m ? change in GHG radiative forcing (Fig, 2b), and 
finds that the relationship does not change significantly over the past 
800 kyr (Extended Data Fig. 6b). This Sicuiqj estimate translates toa 9°C 
(7-13 °C, 95% interval) change in GAST per doubling of atmospheric 
carbon dioxide (3.7 Wm“), which has often been called ESS*"", 

Attenuation of the S;cuicj relationship is apparent in deep glacial 
states and a quadratic relationship is found to be a significantly better 
fit than a linear relationship (Fig. 2b). However, it is unclear whether 
such a quadratic relationship would apply in warmer states—when 
the bottom quarter of the record is removed, a quadratic relationship 
is not significant. Previous research also found Sicuicj to be climate 
state dependent, and most studies find higher values for the late 
Quaternary than for the Pliocene2, The presence of large ice sheets in 
the Quaternary is probably a major cause. Yet little research has focused 
on the potential variation of Sigiicj within the late Quaternary. Masson. 
Delmotte et al.” also found attenuation within the late Quaternary in 
deep glacial states, estimating a parabolic relationship. The observed 
attenuation of late Quaternary Sigua] Seems to suggest there is a limit 
to the power of positive climate feedbacks, such as from sea ice and ice 
sheets, as ice sheet size increases in deep glacial states. 

Because Sjguiqj and ESS are climate state dependent, it is most useful 
to compare this result to other estimates from the late Quaternary. 
Rohling et al.? found a similar ESS estimate of 8.5°C, but did not 
include a comparable probabilistic analysis in their estimate. Hansen 
et al.''> both estimated ESS of 6°C, assuming GAST is half the 
Antarctic temperature change or twice the DWT change, respectively. 
‘The present research finds that there is a 99% probability that ESS for 
the late Quaternary is higher than 6°C. 

‘The new GAST reconstruction also can provide insight into the MPT. 
‘The causes of the MPT, when the Earth's climate shifted from glacial 
cycles with periods of about 41 kyr to those with quasi-100-kyr periods, 
are not well understood and debates continue on the potential linkage 
between different orbital changes and the quasi-100-kyr cycles?! 
Some theories explain the MPT with changes in nonlinear feedbacks 
internal to the climate system, such as changes to ice sheets, sea ice 
or ocean circulation*?!2225, An alternative theory is the erosion of 
continental regolith underneath the ice sheets enabling the growth of 
thicker ice sheets’. 

Probabilistic breakpoint analysis is used to identify any changes in 
cooling trends across the past 2 Myr in the new GAST reconstruction, 
as well as the timing of the cooling trend changes. Such analysis find a 
strong cooling trend after 2Myr ago (Ma) that then stops most probably 
at 1.2Ma (median estimate), with a 72% probability that GAST cooling 
stopped by 1.1 Ma and 77% by 0.9 Ma (Fig. 3a). The timing of when 
the global cooling trend stops roughly corresponds to estimates of 
the beginning of the broad MPT, which is estimated to occur over the 
general period of 1.25 to 0.7 Ma based on spectral analysis of oxygen 
isotopes? Before roughly 1.2 Ma, global temperature cooled gradu- 
ally by approximately 0.34°C (0.16-0.62°C, 95% interval) per 100kyr 
(Fig. 3b). However, since 1.2Ma, GAST stabilized with no significant 
change in global temperature, 0.007 °C (—0.12 to 0.42°C, 95% interval) 
per 100kyr (Fig. 3c). From 1.2 to 0.5 Ma, the behaviour of GAST 
across glacial-interglacial cycles gradually shifted to quasi-100-kyr 
cycles with larger amplitudes of change, as seen in Fig, 1a. Although 
average GAST did not continue to cool after roughly 1.2 Ma, GAST 
did show a particularly large amplitude for the glacial cycle at 0.9 Ma 
of 7°C (4-10°C, 95% interval), which is similar in magnitude to more 
recent, post-MPT glacial cycles. These findings of gradual cooling 
probably pre-dating the MPT and a gradual shift to quasi-100-kyr 
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Figure 3 | Probabilistic breakpoint analysis of global temperature 
trends over the past 2 million years. Shown are empirically fitted 
frequency distributions for the timing of when the trend in global 
temperature changes (a; in black), the global temperature trend before 
the breakpoint (b; in blue), and the global temperature trend after the 
breakpoint (¢; in green). 

cycles are consistent with some previous SST and DWT research, 
The global cooling trend also is synchronous with the development of 
the equatorial Pacific cold tongue and bipolar cooling estimated from 
~1.8 Ma to ~1.2 Ma (ref. 29), However, GAST does not exhibit the 
intensified cooling across the MPT seen in some individual SST records 
and predicted by some MPT theories. 

Several MPT theories employ increases in ice sheet size to explain 
the change in nonlinear climate feedbacks at the MPT, and hypothesize 
that global cooling could be the causal mechanism for such ice sheet 
growth?!22*27, Analyses of orbital responses across the MPT similarly 
suggested that global cooling could have enabled the skipping of 
obliquity cycles, The present research provides evidence of such 
global cooling before the MPT. However, the global cooling probably 
pre-dates the rapid ice sheet growth observed at the deep glacial period 
around 0.9 Ma (refs 4-6) and the development of the first quasi-100-kyr 
cycle by 300 kyr ago (ka). Thus, either additional explanation is required 
to explain the lag after global cooling before MPT climate changes or 
the MPT changes may have been caused by a mechanism not linked to 
global temperature, such as erosion of continental regolith? or orbital 
changes without internal climate changes". 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only in the online paper. 
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METHODS 

GAST reconstruction. Overall approach. This research estimates GAST from local 
SST proxy-based reconstructions through five steps. First, I collect SST proxy- 
based reconstructions and estimate proxy uncertainty from a literature review. 
Second, interpolate the SST reconstructions to common 1-kyr intervals and esti- 
mate dating uncertainty. Third, I estimate average SST values for latitudinal zones 
using a variety of possible spatial weighting schemes. Fourth, I analyse Paleoclimate 
Modelling Intercomparison Project (PMIP) model simulations to obtain an esti- 
mate of the relationship between average SST over latitudinal zones to GAST. I 
use the estimated scalar to linearly scale the average changes in SST for latitudinal 
zones to changes in GAST. Last, I use several approaches to test the validity and 
sensitivity of this approach. 

‘The biggest challenge in this research is that the primary continuous temper- 
ature reconstructions available over the past 2 Myr are mostly from SST proxy 
records. Available terrestrial temperature reconstructions are too infrequent 
and limited in spatial distribution to be used for a global reconstruction at this 
time. Thus, this research develops a method to estimate GAST from a collection 
of SST estimates. Such a scaling must address the fact that SST records do not 
adequately cover the entire Earth surface, and that by definition, SST records do 
not include records of temperature change over land. This is especially important 
given that temperature change is amplified over land and at the poles relative to 
the oceans. ‘The goal of this research is to develop a method that is transparent in 
its assumptions and associated estimates of uncertainty and could be applied in 
the same way across the past 2 Myr, even though there are far fewer SST recon- 
structions available farther back in time. Because GAST estimates are often used 
to investigate questions related to climate sensitivity, the designed approach should 
not be dependent on any assumptions of climate sensitivity. 

‘To investigate potential approaches to this challenge, this research uses the 
PMIP global climate models"*** because the PMIP model experiments provide 
estimates of local SST and air surface temperature as well as GAST for both the 
LGM and the pre-industrial state, thus covering most ofthe range of temperatures 
over the past 2 Myr. Because the highest latitude SST reconstructions are at about 
60° N/S, I compared the ratio of change in GAST to the average change in SST in 
all SST grid cells 60° N-60! S. The estimated scaling factors are found to not be 
correlated with other model features, such as climate sensitivity and LGM GAST 
(Extended Data Fig. 4). Hargreaves et al.® similarly find that LGM GAST is not 
correlated with the model’ climate sensitivity values. This is very important, since 
a main goal ofthis approach is to develop a method that is not inked to a particular 
climate model or a particular estimate of climate sensitivity. Another approach 
‘would be to scale just from average tropical SST changes, but that would ignore 
valuable data from high latitude records, would be more uncertain due to the rel- 
ative larger contribution ofthe scalar value, and would be more model dependent. 

Itis important to note that the concept of a regional average SST over latitudes 
{60° N-60" $ does not have direct physical meaning or relevance as itis not a 
regional average since it does not include the land area in latitudes 60° N~60° S 
and only averages over SST grid cells. Nor does the described scalar of change in 
GAST relative to change in SST over 60° N-60° S, However, taking a simple average 
over latitudinal bands isa stable and transparent way to summarize the available 
SST reconstructions. Moreover, there is no assumed functional relationship of 
SST with latitude or with GAST, except for the single coarse scaling metric. This, 
method ensures that the full change in GAST at glacial maxima is captured in the 
final estimated reconstruction by using the LGM scalar from the PMIP models. 

Robiling et al’ used a somewhat similar approach of a spatial average of SST 

to estimate global climate sensitivity. They estimated a quadratic relationship 
of change in SST over latitude from 36 SST reconstructions and integrated that 
function to calculate a global mean response of SST, They then adjusted for a 
stronger terrestrial response to scale toa larger estimate of global climate sensitivity, 
but they did not produce an estimate of GAST. 
SST proxy-based reconstructions. This analysis utilizes a multi-proxy database 
compilation ofall available and reliable SST reconstructions that cover at least the 
past 100 kyr, The SST database includes 61 SST proxy reconstructions from 
59 ocean sediment cores: 29 using alkenone unsaturation indices (UX), 17 using 
ratios of Mg/Ca in planktonic foraminifera, and 16 based upon microfossil 
abundances (using transfer functions for planktonic foraminifera and radiolarians) 
(Extended Data Tables 1, 2; Extended Data Figs 1, 2; Supplementary Data). 
‘A multi-proxy approach enables a reduction of the uncertainties and potential 
biases specific to each proxy method by combining estimates from several 
independent proxies*', 

Proxy methods have a variety of potential sources of error, including proxy 
measurement, seasonality, species dependence, productivity, water column 
depth, mixing, and dissolution and other post-depositional alteration. Estimates 
of the measurement and calibration errors are available from laboratory and 
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field experiments for the different proxy methods (for example, from alkenone 
indices", Mg/Ca ratios’”* and species assemblages”) and typically range from 
1°C to 3 °C for two standard deviations. However, the published uncertainty 
estimates often do not include considerations of structural uncertainty from the 
assumptions of the proxy method. Therefore, use the upper range of the published 
values of 3°C (95% interval) as an estimate for the combined uncertainty for each 
ofthe SST proxy methods, 

Dating uncertainty and SST reconstruction interpolation. It is imperative that com- 
parisons between palaeoclimate records include the uncertainty in matching which 
parts of each record occurred at the same point in time". To interpolate each SST 
reconstruction to a common 1-kyr timescale, [estimated a weighted average of 
the SST reconstruction for each time point on the I-kyr timescale. The weights are 
based on the distance in time between the reconstruction value and the time point 
of interest, The bandwidth is set by the dating uncertainty for that time point. I use 
the published age scales for the SST reconstructions. I use the estimate of 10 kyr 
(95% interval) for dating uncertainty from orbital tuning™*"~®, unless papers 
provide specific estimates of uncertainty in their timescales. The uncertainty in the 
estimated interpolated value is estimated from a weighted average of the squared 
differences between the reconstruction values and the estimated interpolated value. 
implement this method using a Nadaraya-Watson kernel-weighted local constant 
regression, using the function ksmooth in the R statistical program (hitp://statethe. 
ch/R-manual/R-patched/library/stats/html/ksmooth.htm)), This method results in 
larger SST uncertainty during periods of rapid change than during periods of stable 
SST, thus reflecting the varying potential impact of dating uncertainty. 

SST averages over a latitudinal zone. The spatial distribution of the available SST 
records is too sparse to use spatial statistics to estimate average SST. An alternative 
approach is to apply a simple assumption of a quadratic change in SST with 
latitude’, but such a method adds a large amount of uncertainty when there are 
fewer available reconstructions. ‘The simplest approach would be a direct average 
ofall SST reconstructions equally, but that ignores the known general amplification 
of change in SST with latitude and would make the estimate very dependent on 
the particular distribution of the available reconstructions. The proposed method. 
isa middle ground: SST is first averaged across records within a single latitudinal 
zone and then the latitudinal zones are summed using applicable spatial weights. 
Because it is spatially averaged, it is not purely driven by high latitude records, 
and the reduction in uncertainty from multiple records in a given latitudinal zone 
is captured. 

"To explore how the 60° N-60"S average SST estimate varies with different 
latitudinal zone boundaries, | use 9 different possible configurations of latitudinal 
zone boundaries used with equal weights in the final ensemble: 

+ Four zones: 

{60° N-30° N, 30° N-0°, 0°-30°S, 30° 
(60° N-25.7° N, 25.7" N-0°, 0° 
{60° N-20° N, 20° N- 
{60° N-35° N, 35° N. 
+ Six zones: 

60° N-40° N, 40° N 
degrees); 
{60° N-35.3° N, 35.3° N-16.8° N, 16.8° N-0°, 0°-16,8° S, 16.8° S-35,3° S, 35.3° 
S-60 $ (equal areas); 

{60° N-40° N, 40* N-15°N, 15° N-0°, 0°-15 40° $, 40° S-60" S; 

{60° N-35° N, 35° N-20° N, 20° N-0°, 0°-20° S, 20° S-35° S, 35° S-60" S; 

{60° N-30° N, 30° N-15°N, 15° N-0°, 0°-15° , 15° S-30" S, 30° S-60"S, 

‘The SST proxy records have limited geographical distribution, variable length 
and variable resolution. In particular, the records are clustered in space and have 
non-random spatial distribution (Extended Data Figs 1a, 2). Thus, I explore two 
approaches in analysing the records. In the first, I include all 61 SST proxy records 
and weight them equally. In the second approach, I identify locations where there 
is more than one record within a circle of radius 5° latitude/longitude. | identify 
11 such clusters that include 43 proxy records in total. For each cluster, [estimate 
the mean value over time and the variation across the cores. I plot the resulting. 
29 records (11 clusters and 18 independent proxy records) for this ‘clustered’ 
analysis in Extended Data Fig, 1b, 

Because this research is focused on change in SST not absolute values of SST, it 
is important that the SST reconstructions are normalized to change from present 
before they are summed within a latitudinal zone. ‘The rest ofthis research defines 
present as the mean value 0-5 ka, However, in this particular instance, 8 of the SST. 
reconstructions have their first estimate between 5 and 8 ka. Rather than assume 
an estimate for 5 ka, this research uses the mean value 0-10 ka when estimating 
the latitudinal zone averages. Once the weighted average of the zones is estimated, 
the deviation from present is then recalculated to be the mean value 0-5 ka to 
censure consistency. 


60° S (equal degrees); 
25.7° $-60" S (equal areas); 


20° $-40° §, 40° S-60° S (equal 
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‘This analysis uses Monte Carlo-style simulations to estimate several sources 
of uncertainty from the proxy reconstruction by adding random error to each 
reconstruction from the estimated proxy uncertainty and the estimated uncer- 
tainty introduced from absolute dating (see previous discussion for those estimates) 
in each simulation. I explore structural uncertainty in the averaging method by 
randomly resampling the different latitudinal zonal boundaries. [also randomly 
resample the proxy cores to explore the uncertainty introduced from a particular 
set of proxy records. The three general sources of uncertainty—from the proxy 
reconstructions, from the latitudinal zone boundaries, and from the sample of 
records—are all major contributors to the final uncertainty distribution, Random 
resampling of the SST proxy records is the largest contributor to the uncertainty. In 
total, I calculate 4,000 simulations for time series of average SST over 60° N60" S. 

I repeat this approach with the 29 records from the clustering procedure described 
above, and find similar results for the final GAST estimate (Extended Data 
Fig. 3a, b). 

Scaling from latitudinal SST averages to GAST. As discussed previously, this analysis 
uses the PMIP global climate models*!* to scale regional SST averages to GAST. 

‘This is necessary because there are insufficient land temperature proxies availa- 
ble over the past 2 Myr. I use model simulations of the LGM because they most 
closely compare to the large changes seen in the reconstructions. For a specific 
LGM climate model simulation, [estimate the change in temperature between 
the LGM and pre-industrial runs for both SST and surface air temperature. 

I then estimate the ratio of change in GAST to change in 60° N-60” $ average 
SST to be used as a scaling factor (Extended Data Fig. 4). The analyses are per- 
formed for all available model simulations using the PMIP2 database from 30 July 
2009 (https://pmip2,sce.ips1fr/database/access/request.shtml;) and the PMIP3 
database from CMIPS archives at PCMDI from 10 December 2015 (http:// 
cmip-pemdillnl.gov/cmip5/data_portal.html). Because of the uncertainty in apply- 
ing these scalars to estimate GAS, I more than double the standard deviation of 
the estimates from the sample of nine models (0.14) and use a scaling factor of 
1.9 (,5-2,3, 95% interval), assuming a normal distribution that includes 8 of the 
9 models in the middle 67%. ‘The uncertainty in the scaling factor causes approx- 
imately a doubling of the standard deviation of the final GAST reconstruction at 
‘each point in time. 

‘This research uses 60° N-60° S because that is the maximum extent of the SST 
reconstructions. However, the high latitude Southern Hemisphere reconstructions 
are fairly short, and for most of the past 2 Myr, the highest latitude SST recon 
struction is at 43° $ in contrast to 58° N in the Northern Hemisphere. Sensitivity 
analysis finds that repeating the analysis with the PMIP models using 50° N-50° S 
finds a very similar range of scalar factors (the mean value changes by only 1%). 
‘Therefore, this research continues to use the scaling factor estimate of 1,9 (15 
95% interval) as equally applying to a range of 60° N-60° S or 50° N-50" S, 

‘To estimate GAST reconstructions, I use Monte Carlo-style simulations to 
propagate all the previously mentioned sources of uncertainty. I sample from 
the simulations of 60° N-60° S average SST described previously and apply a 
scaling factor randomly chosen from the uncertainty distribution to calculate 
5,000 simulations for the potential time series of GAST (Fig. 1a). The final GAST 
simulation ensemble of potential time series includes propagation from simulating 
the potential errors in each proxy reconstruction, from resampling the proxy 
records, and from randomly varying the spatial weighting methods and the scaling 
factors of regional average SST to GAST, When simulating GAST time series, the 
time series stops when any latitudinal zone no longer has any reconstructions 
available. Therefore, the lengths of the individual time series within the final 
simulation GAST ensemble of potential time series (Extended Data Fig. 2c) are 
reflective of the availability of SST proxy reconstructions (Extended Data Fig. 2b). 

‘The assumption of linear and constant scaling over the past 2 Myr isa weakness 
in this approach, but itis necessary because of the limited available data”. One 
way to explore how that scalar value could potentially change over time is by 
investigating the PMIP simulations of the mid-Pliocene warm period (mPWP, 
3.0-3.3 Ma)", An analysis of cross-model means finds a scalar of 1.6, which is 15% 
ower than the median estimate from the LGM PMIP experiments and is within the 
proposed confidence interval. Moreover, the mPWP was much warmer (1.8-3.6°C 
warmer than present) with much less ice and sea levels higher by 22+ 10 m (ref. 44). 
At the Pleistocene transition at ~2.7 Ma, there was a substantial increase in 
Northern Hemisphere ice sheets, and by 2.4 Ma, the climate transitioned to 
l-kyr glacial cycles until the MPT". Therefore, the Earth’ climate over the past. 
2 Myr was much more similar to the LGM than to the mPWP. Thus, ifthe scalar is 
state dependent, as the model simulations suggest, the value at 2 Ma is likely to be 
loser to 1.9 than 1.6 and thus well within the proposed interval. 

‘To test the impact of the assumption of a constant linear scalar of the ratio of 
change in GAST to change in average SST over the latitudinal zones 60° N to 60° S, 
analyse two alternative methods. Rather than use the single value of 1.9 estimated 
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from the LGM, I use a moving scalar that is defined by the time points of 1.9 at 
the LGM and 1.6 at the mPWP. Because the scalar is thought to potentially vary 
‘with climate state, I use two different time series to construct the scalar time series: 
reconstructions of deep sea oxygen isotopes” and of relative sea level°. [linearly 
scale each of these time series such that their mean value at the LGM (mean over 
19-23 ka) is 1.9 and their mean value at the mPWP (mean over 3-3.3 Ma) is 1.6. 
‘The resulting estimated median GAST time series are very similar to the GAST 
reconstruction estimated from a constant scalar: 0.998 correlation for the deep sea 
‘oxygen isotopes method and 0,998 for the relative sea level method. Investigations 
‘of polar amplification are also similar: 0.59 (0.45 to 0.79, 95% interval) for the deep 
sea oxygen isotopes method and 0.59 (0.45 to 0.74, 95% interval) for the relative 
sea level method, as compared to 0.61 (0.43 to 0.85, 95% interval) for the primary 
GAST reconstruction. The regressions also find that a quadratic relationship is, 
not significant, The uncertainty estimates of the two alternative approaches are 


underestimates because they do not include uncertainty in the base reconstructions 
themselves nor do they include any uncertainty in the LGM and mPWP estimates 
used to scale the base reconstructions. 

‘Validity testing the GAST reconstruction using particular points in time. To 
test the validity of the GAST reconstruction, the new record can be compared to 
previous published reconstructions for points of interest, such as the LGM or the 
Last Interglacial. The new GAST reconstruction finds global cooling at the LGM 
(~21 ka) of 62°C (4.5-8.1°C, 95% interval) from the present value. This estimate is 
similar to the recent IPCC synthesis ofthe ‘very likely’ range (>95% probability) of 
3-8°C (ref. 1). The present analysis finds a higher most-likely value consistent with 
other recent proxy-based reconstructions of the LGM!™#®", Lower model-based 
estimates of LGM cooling could be influenced by underestimates of the changes in 
LGM radiative forcing, such as ftom changes in dust and vegetation", or under- 
estimates of climate sensitivity at the LGM. The GAST estimate for maximum 
‘warming during the Last Interglacial (~125 ka) is 2.0°C (0.4-3.6°C, 95% interval) 
‘warmer than present. This result is consistent with recent proxy estimates: sea level 


rise is likely to have exceeded 8 m above present levels”, a large warming over 
Antarctica during the Last Interglacial®, and a GAST estimate of 1.9°C warming 
above pre-industrial levels*. The present estimate for maximum warming during 
the Last Interglacial is higher than some model simulations. Comparisons of the 
new GAST reconstruction with additional palaeoclimatic reconstructions find 
strong correlations, as would be expected (Extended Data Table 3), 

‘Validity testing the GAST reconstruction using PMIP model outputs. ‘The PMIP 
‘model outputs are used to test the robustness of this research’s method for esti- 
‘mating GAST. The question explored is: ifthe proposed approach of this paper is 
applied to grid cell values from the models for the same locations as the SST recon~ 
structions, how would the estimated GAST value compare to the model's GAST 
value? For each of the nine PMIP model simulations available, nearest neighbour 
classification (via the knm1 function in the class package"? in the R statistical pro- 
‘gram, http://cran.r-project.org/web/packages/class/index htm!) is used to identify 
the grid cell closest to each SST proxy reconstruction. The change in SST from 
present to the LGM at those locations is used as an input to the GAST estimation, 
‘methods described above. The models’ surface air temperature outputs are used to 
directly estimate GAST for each model, The final estimates of GAST using the 61 
SST proxy reconstruction locations are then compared to the models’ GAST values 
(Extended Data Fig. 5). The same procedure is repeated with only the locations of 
the 5 SST reconstructions that cover the full past 2 Myr (Extended Data Fig. 5). The 
‘median estimates for change in GAST at the LGM using PMIP model outputs are 
very similar (—4.5°C directly ftom the models, versus —5.1°C from the 61 record 
locations and —4.9°C. from the 5 record locations) when combined across the 
9 models, and well within the estimated 95% interval. The method of this paper 
does find a larger range of uncertainty in GAST, as to be expected when comparing 
61 or 5 grid cells to the full surface air temperature model outputs (Extended Data 
Fig. 5). Based on these 9 models that are available, the median estimates suggest 
the new GAST reconstruction may overestimate cooling by 10%, but the median 
estimates vary in direction and magnitude across the individual models and thus 
such a conclusion is specific to the set of model outputs. 

Robustness of the GAST reconstruction to particular sample of cores. Early in 
the reconstruction the estimate is based on 61 reconstructions, but at 400 ka on 
only 14 reconstructions, at 800 ka on only 8 reconstructions, and at 2 Ma on only 
5 reconstructions (Extended Data Fig, 2). This research uses three different analyses 
to assess the robustness of the final GAST reconstruction to the particular set of 
SST reconstructions currently available. First, the final GAST ensemble includes 
bootstrap Monte Carlo-style simulations that resample the reconstructions 
before following the methods to calculate GAST. The final ensemble of GAST 
time series are directly from this bootstrap simulation, and the overall uncertainty 


thus includes the uncertainty caused by the particular set of 61 reconstructions. 
Second, the entire methodology, including the bootstrap simulations, is repeated 
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for just the 11 clusters plus 18 individual records, as described above, The entire 
methodology, including the bootstrap simulations, is also repeated for just the 
5 SST reconstructions that cover the full past 2 Myr, Extended Data Fig. 3a, b 
compares the median estimates from those two variations to the primary GAST 
reconstruction. Although the reduction in number of reconstructions causes larger 
uncertainty and potential different variance structures, the median estimates are 
very similar (0.997 correlation for the clustered version and 0.953 correlation for 
the 5-record version). Third, as described previously, this analysis’ methodology 
is applied to the PMIP model outputs for both the full 61 reconstructions and just 
the 5 records that cover the full past 2 Myr and finds GAST estimates consistent 
with the models air surface temperature outputs (Extended Data Fig. ). 
DWT reconstructions. This analysis compares GAST to 12 different proxy-based 
reconstructions of changes in DWT developed from three different methods 
(Extended Data Table 2), Elderfield et al.‘ estimate DWT using Mg/Ca ratios 
from bottom-dwelling foraminifera. Rohling ct al’ estimate sea level using surface 
planktonic oxygen isotopes from Mediterranean Sea sediments and then use the 
sea level estimate to remove the ice-volume effect from the global benthic oxygen 
stable isotope data to estimate global DWT changes. Bates et al estimate DWT 
from oxygen isotope records from benthic foraminifera shells using regression 
analysis for ten different deep-ocean records, using a much simpler model for the 
relationship between benthic oxygen isotopes and DWT as well as older age models 
than the other two reconstructions 
GHG radiative forcing. For GHG concentrations from the past 800 kyr, use 
stacked reconstructions from Antarctic ice cores"*'**, The same I-kyr inter- 
polation method is applied to these records as described above for the SST 
reconstructions. I calculate the radiative forcing changes for CO; and methane 
from the past 800 kyr using the equations from Kohler et al.” and Hansen 
et al" of total forcing = a(CO; forcing + 4x methane forcing). | apply highly 
conservative 95% intervals for the parameters in the proposed approximations: 
«@ (approximation for NO) is 1.12 (1.0-1.24, 95% interval), (efficacy of methane) 
is 14 (1.0-1.8, 95% interval), and I apply conservative uncertainty of 20% (95% 
interval) for the overall equation, 
Regression analyses. To investigate the relationship between the new GAST recon- 
struction and other palaeoclimatic reconstructions (for example, Extended Data 
‘Table 3 and lines in Fig. 2), I estimate ‘GAST sensitivity’ (the estimated linear 
relationship of change in GAST for each unit of change in the palaeoclimate record) 
and the correlation between the two reconstructions. I first randomly sample a 
single time series from the GAST simulation ensemble of potential time series 
(described above) and a single time series from the simulation ensemble for the 
comparison record. I then normalize each record to be deviations from present, 
where present is defined as the mean value over 1-5 ka. It is necessary to use 
‘5 ka because not al reconstructions have estimates for 1-3 ka, use weighted least 
“squares regressions without an intercept, because both records are deviations from 
present. I test both linear functions and nonlinear, such as quadratic, relation- 
ships and I use an ANOVA test to assess the improved fit of alternative functional 
relationships. I also quantify the correlation of the two time series. I then repeat 
the analyses for at least 500 random draws of time series from each reconstruction. 
Because there is high autocorrelation in most palaeoclimatic reconstructions, 
include an autoregressive model to evaluate the potential underestimation of error 
in the regression coefficients that can be caused by autocorrelation’, For example, 
the autoregressive model for GAST as a function of radiative forcing from GHGs 
does not significantly change the regression coefficient, as predicted by theory, 
but it does increase the estimated standard error ofthe regression coefficient from 
1.8% to 3.5% of the median value. However, the regression standard error isan 
insignificant contribution to the overall uncertainty analysis, which estimates an 
analogous standard error of 19% overall. To assess whether the regression results 
vary for deep glacial states, I define deep glacial periods as the bottom 25% of the 
comparison record (less than —2 W m-~* for GHG radiative forcing) and repeat 
the analyses with the two sets of time periods separately. I convert the estimate 
of change in GAST to change in GHG radiative forcing to ESS by multiplying by 
3.7 W m ® (the change in radiative forcing from a doubling of C02). 
Investigations of ESS are limited by the availability of GHG reconstructions 
Because reconstructions of atmospheric CO: before 800 ka are highly uncertain 
and limited in temporal resolution and reconstructions of methane before 800 ka 
do not exist, this research focuses on analysis of the past 800 kyr of GHG 
reconstructions. However, l analyse the reconstructions of CO, based on boron 
isotopes that are available for limited time points across the past 2 Myr. use 
the equation from Kobler et al.” to estimate CO; radiative forcing. I repeat 
the regression analyses described above for the limited set of time points to estimate 
GAST sensitivity to COs radiative forcing and I repeat the analysis separately for 
0-1 Maand 1-2 Ma (Extended Data Table 3 and Extended Data Fig. 7). For the 
past 1 Myr, the results are substantially similar to the results obtained from a 
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‘comparison with CO: reconstructed from ice cores, with lower correlation and 
coefficient estimates as would be expected due to the higher uncertainty in the 
CO; reconstruction. Prior to 1 Ma, CO: radiative forcing from boron isotopes is 
poorly correlated with GAST changes, suggesting either a decoupling of GAST 
and CO; before the MPT, or more likely, errors in the CO; reconstruction, GAST 
reconstruction, and/or the relative dating of the records. 

Probabilistic breakpoint identification. This research uses probabilistic break- 
point identification to estimate changes in GAST time trends over the past 
2 Myr. Breakpoint simulation detects and identifies changes within time series 
by decomposing the time series into linear trends and breakpoints. This research 
uses the bfast function from the bfast package" in the R statistical program (http:/] 
cran.r-project.org/web/packages/bfast/index.html), which iteratively estimates 
time trends and break points through a piecewise linear trend to identify optimal 
values. ‘The bfast simulation program is applied independently to 500 randomly 
selected time series from the final ensemble of GAST time series to estimate the 
cempirically-fitted frequency distributions in Fig, 3. The program also estimates the 
time trend before and afer the breakpoint. No smoothing is used in this analysis 
Data and code availability. Supplementary Methods includes R code for key 
methods described in the paper. Supplementary Data includes the new GAST 
reconstruction at 2.5%, 5%, 25%, 50%, 75%, 95% and 97.5% likelihood values, and 
the 61 SST reconstructions used to create the GAST reconstruction, including a 
detailed summary table, 
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Extended Data Figure 1 | Spatial distribution of the SST proxy reconstructions used in this analysis. a, All 61 SST records, with methods as follows: 
from alkenone indices, blue circles; from Mg/Ca ratios, red triangles; and from species assemblage methods, brown squares. b, Repeated after clustering 
records within 5° latitude/longitude of each other, with the 11 clusters in cyan diamonds and the remaining 18 records as in a, 
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Extended Data Figure 2 | Temporal distribution of the 61 SST proxy 
reconstructions used in this analysis. a, Reconstruction length versus 
latitude, colours as in Extended Data Fig. 1. b, Empirical cumulative 
distribution function for lengths of the SST proxy reconstructions. 

¢, Empirical cumulative distribution function for lengths of GAST time 
series in the final simulation ensemble of potential GAST time series. 
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Extended Data Figure 3 | Comparison of different methods used 
to estimate GAST. a, The primary GAST estimate (using 61 proxy 
reconstructions) is plotted as a function of time, with the median in black 
and the 95% interval in grey. The GAST estimation method is repeated 
for a clustering of the data (11 clusters and 18 individual reconstructions), 
with the median shown in cyan, and for only the 5 proxy reconstructions 
that cover the past 2 Myr, with the median shown in orange. b, The median 
time series from each alternative method are plotted against the primary 
median GAST estimate, with the clustered version in cyan circles and 
the 5-record version in orange squares. c, The primary GAST estimate 
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Median AGAST from constant sealing (°C) 
is plotted as a function of time, with the median in black and the 95% 
interval in grey. An alternative GAST estimation method using a time- 
varying scalar based on the deep-sea oxygen isotopes median estimate 
is shown in green, and another estimation method based on the relative 
sea level median estimate is shown in purple. d, The median time series 
from each alternative method is plotted against the primary median GAST 
estimate, with the reconstruction scaled using deep sea oxygen isotopes 
shown in green circles and the reconstruction scaled using relative sea 
level shown in purple squares. 
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Extended Data Figure 4 | Estimates of the ratio of change inGAST to _ asa function of change in GAST at the LGM (a) and of model climate 
change in average SST. a, b, Scatter plots show the dependency of the sensitivity (b). The climate sensitivity estimates (in °C per W m~*) are 
ratio of change in GAST to change in average SST over the latitudinal zone from ref. 33. Dashed lines show the scalar range used in this analysis. 
60° N to 60° S from PMIP2 and PMIP3 climate model simulations’ »*? 
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Extended Data Figure 5 | Estimating change in GAST at the LGM using 
simulations drawn from PMIP model outputs. The solid, purple line 
is the empirically fitted frequency distribution (shown in density on the 
yaxis) of GAST estimated from the full air surface temperature outputs 
from the 9 PMIP models. The dashed, black line is the distribution of 
GAST estimated using the method in the present paper and the PMIP SST 
outputs drawn from only the locations of the 61 proxy reconstructions. 
‘The short-dashed, orange line is the same analysis completed for only the 
5 proxy reconstructions that cover the past 2 Myr, The thin vertical lines 
are the medians of each distribution. 
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Extended Data Figure 6 | The dependence of coupling relationships 
over time for GAST on changes in Antarctic temperature and GHG 
radiative forcing. a, b, Regression results of change in GAST as a function 


of change in Antarctic temperature" (a) and of change in GHG radiati 
forcing'*!*** (b) are calculated for moving 200-1 


1-long time windows 
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every 5 kyr. The solid line shows the median estimates, with the coloured 
and grey-shaded areas showing the 50% and 95% intervals, respectively. 
‘The dashed lines show the 95% intervals calculated from the entire time 
series. 
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Extended Data Figure 7 | Comparison of changes in GAST to changes in 
CO, radiative forcing. Boron-isotope-based proxy reconstruction of COz 

from refs 17, 56, Blue points are from 0-1 Ma, red points are from 1-2 Ma, 
and error bars show 95% intervals. 
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Extended Data Table 1 | Database of SST proxy reconstructions based on Mg/Ca ratio and species assemblages used in estimating GAST 


Citation Latitude Tongitude ‘Start Date End Date 
(+N-S) (+E-W) (yr BP) (yr BP) 
‘Mg/Ca Ratio SST Proxy Reconstructions 
de Garidel-Thoran et al. 2005 2 142 7,000 1,788,000 
Lea 2004” 2 1 4,000 361,000 
Lea et al. 2006 0 92 1,200 195,100 
Mashiotta et al, 1999 °7 -43 80 2,700 293,700 
Mashiotta et al. 1999 °7 56 115 8.110 108,450 
Medina-Elizalde and Lea 2005 "* ° 159 4,300 1,348,000 
Mohtadi et al. 2011 % 6 103 o 131,400 
Namberg et al. 2000 2 12 240 274,830 
Nurnberg et al. 2000 6 eT 1,000 271,000 
Oppo and Sun 2005 20 118 41,943 142,571 
Pahnke et al. 2003 -45 175 1,951 340,835, 
Rickaby and Elderfield 1999 * -40 85 4,750 196,710 
Rickaby and Elderfield 1999 44 90 3,800 283,670 
Russon et al. 2010 -23 166 o 2,438,000 
Saraswat et al. 2005 * 3 78 5.434 197,333 
Wei et al. 2007 ® 20 "7 1,740 261,300 
Weldeab et al. 2007 ” 2 8 360 155,420 
Species Assemblages SST Proxy Reconstructions 
Barrows et al. 2007 42 170 3,830 142,712 
Barrows et al. 2007" -46 175 5,401 133,995, 
Brathauer and Abelmann 1999” -43 12 387 338,519 
Kandiano et al. 2004” 54 -20 2,975 193,085 
Labeyrie ot al. 1996 ”; Rickaby and Elderfield 1999 -46 96 4,590 148,250 
Martinson et al. 1987 -44 80 750 284,000 
Pisias and Mix 1997 ™* 0 -96 360 752,047 
Pisias and Mix 1997 a 83 1,914 151,051 
Pisias and Mix 1997 * ° “110 880 144,763 
Pisias and Mix 1997 7* 0 -86 4,479 184,653 
Pisias and Mix 1997 16 “78 785 466,520 
Weaver et al, 1998 ° 44 “172 a 170,000 
Weaver et al. 1998 * -46 172 ° 116,705 
Weaver et al, 1998 * “45 178 170 120,880 
Weaver et al. 1998 ° -43 178 a 154,600 
Weaver et al. 199° 60 23 0 120,000 


Data are taken trom refs 27,29, 42, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70,71, 72, 73, 74,75 and 76, 
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Extended Data Table 2 | Database of SST proxy reconstructions based on alkenone indices used in estimating GAST 


Citation Latitude: Longitude Start Date End Date 
(+NAS) (+E-W) (yr BP) (yr BP) 
‘Alkenone Indices SST Proxy Reconstructions 
Bard 2002” 38 10 0 110,100 
Bard et al. 19977* 0 46 0 151,500 
Bard et al. 19977* 3 50 0 149,200 
Clemens et al. 2008 ”; Herbert et al. 2008 19 116 4178 2,185,910 
Dubois et al. 2011" 2 “31 4,070 99,290 
Eglinton et al. 1992 a 18 736 653,042 
Herbert et al, 2010 2 42 1,280 445,000 
Herbert et al. 2010 v7 60 7.110 3,390,480 
Herbert et al. 2010 3 31 5,550 5,069,892 
Horikawa et al, 2006 * a “95 4,100 154,097 
Lawrence et al, 2008 * a “31 5,230 5,089,802 
Lawrence et al. 2009 58 a7 ° 4,012,230 
Martinez-Garcia et al. 2010 * -43 9 0 3,642,410 
Muller et al. 1997" -20 9 3,300 402,900 
Pahnke and Sachs 2006 *” 46 175 4,950 156,360 
Pahnke and Sachs 2006” -40 178 3,320 195,050 
Pelejero et al, 1999 * 15 m 0 221,100 
Pelejera et al. 1999 “* 8 112 o 142,290 
Pelejero et al. 2006 -42 170 3,570 288,540 
Pelejero et al. 2006 “44 150 5,045 459,632 
Rostek et al. 1997 14 53 2,300 152,600 
Pahnke and Sach 206°”; Sachs and Anderson 2003 *" “41 9 6,080 160,000 
Schneider et al, 1995" 6 10 400 189,000 
Schneider ot al. 1995 12 " 1,300 200,600 
Sicre et al. 2000 25 16 5,000 144,000 
Villanueva et al. 1998 43 -30 2,900 285,900 
Yamamoto et al. 2004, 2007 **** 32 119 870 157,048 
Yamamoto et al. 2004, 2007 °° 34 “122 3,186 196,475 
Zhao et al, 2006” 9 110 905 148,886 


Data are taken from rele 29,77, 78,79, 20, 81, 82,83, 84, 85,86, 87, 88, 89, 90, 91, 92, 93, 94,95, 96 and 97. 
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Extended Data Table 3 | Comparisons of GAST with other important palaeoclimate reconstructions 


Record Compared to GAST “GAST Sensitivity Correlation 
Median (95% Interval) Median (95% Interval) 

Total Greenhouse Gas Radiative Forcing 25 “OWI (78,36) 082 (0.66, 0.92) 
CO, Radiative Forcing from ice cores'?"* 3.0°Cnwim? (2.1.45) 0.82 (0.63, 0.92) 
CO, Radiative Forcing trom Boror-isotopes, 0-1Ma"’* 2.4°cnwie? (18,34) 0.74 (0.82, 0.84) 
CO, Radiative Forcing trom Boron-isotopes, 1-2Ma'”* 1.3 °Cwim? (0.87, 22) 0.28 (0.10, 0.55) 
‘Antarctic Temperature stack" oe crc (0.49, 0.85) 072 (0.59,0.81) 
Greenland Temperature-GRIP™ ost cre (0.23, 0.43) 0.89 (0.83, 0.92) 
Greenland Temperature-NGRIP” 0.27 °cr°c (0.19, 0.35) 0.91 (0.87, 0.93) 
Deep-Sea Oxygen Isotope” 42 °OMe (56,30) 0.85 (0.94, -0.58) 
Deep-Water Temperature from Southwest Pacific (Core 1123)* 22°cre (15,32) 064 (0.35, 0.78) 
Deep-Water Temperature from Mediterranean Sea 1.7 °0rC (0.8, 2.9) 047 (0.14, 0.64) 
Deep-Water Temperature from Temperate North Atlantic (SDP 

1.5 °CrC (1.0,2.2) 0.48 (0.36, 0.64) 
607)° 
Deep-Water Temperature from East Tropical Atlantic (ODP 658)° 1.5 °0rC (1.0, 2.1) 0.84 (0.71,0.90) 
Deop-Water Temperature from Indian Ocean (ODP 758)* 32°0re (2.0, 4.8) 027 (0.14, 0.42) 
Deep-Water Temperature from Equatorial Eastern Pacific (ODP 

3.0*ore (2.1,4.2) 0.85 (0.78, 0.89) 
849) 
Deep-Water Temperature from Sub-Arctic North Atlantic (ODP 

1.4°0rc (1.0,1.9) 075 (0.40, 0.85) 
980, ODP 981)° 
Deep-Water Temperature from Sub-Antarctic South Atlantic 

2a°cre (1.4,3.0) om (061,082) 
(ODP 1090)" 
Deep-Water Temperature from South China Sea (ODP 1143)° 22°0r0 (15,31) 0.79 (0.64, 0.85) 
Deep-Water Temperature from South China Sea (ODP 1148)° 3.5°Cr"C (2.4, 5.0) 0.54 (0.42,0.64) 
Deep-Water Temperature from Equatorial Eastern Pacific 

24*cre (16,33) 0.60 (0.37, 0.69) 
composite (V19-30, ODP 677, ODP 846)" 
Deep-Water Temperature from Southwest Pacitic (Core 1123)° 1.9°0rc (13,26) ot (0.58, 0.93) 


‘GAST sensitivity" isthe estimated linear relationship of change in GAST for fs 4, 5,6, 14, 17, 18, 20, 54, 56, 9B and 9. 


ch unit af change in the palaeoclimate record, Data are taken from 
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Progressive incision of the Channeled Scablands by 


outburst floods 


Isaac J. Larsen! & Michael P. Lamb* 


‘The surfaces of Earth and Mars contain large bedrock canyons that 
were carved by catastrophic outburst floods'*. Reconstructing 
the magnitude of these canyon-forming floods is essential for 
understanding the ways in which floods modify planetary surfaces, 
the hydrology of early Mars* and abrupt changes in climate’. Flood 
discharges are often estimated by assuming that the floods filled 
the canyons to their brims with water; however, an alternative 
hypothesis is that canyon morphology adjusts during incision such 
that bed shear stresses exceed the threshold for erosion by a small 
amount®. Here we show that accounting for erosion thresholds 
during canyon incision results in near-constant discharges that are 
five- to ten-fold smaller than full-to-the-brim estimates for Moses 
Coulee, a canyon in the Channeled Scablands, which was carved 
during the Pleistocene by the catastrophic Missoula floods in 
eastern Washington, USA. The predicted discharges are consistent 
with flow-depth indicators from gravel bars within the canyon. 
In contrast, under the assumption that floods filled canyons to 
their brims, a large and monotonic increase in flood discharge is 
predicted as the canyon was progressively incised, which is at odds 
with the discharges expected for floods originating from glacial lake 
outbursts. These findings suggest that flood-carved landscapes in 
fractured rock might evolve to a threshold state for bedrock erosion, 
thus implying much lower flood discharges than previously thought. 

Field investigation of the Channeled Scablands, which contains vast 
tracts of scoured bedrock, giant gravel bars and deep canyons, led to 
the eventual recognition that short-lived, catastrophic floods, rather 
than uniformitarian processes, were the dominant agents of canyon 
formation in the region’, Outburst floods caused by the sudden 
drainage of lakes due to the failure of glacial or other natural dams 
have been documented in many settings on Earth’; and enormous 
bedrock canyons, morphologically similar to those in the Channeled 
Scablands, provide evidence for widespread outburst flooding on the 
surface of Mars*. Canyons carved into planetary surfaces are hence a 
key record of the palaeo-hydrosphere’, and constraining the magni- 
tude of the canyon-carving floods is required to understand whether 
megafloods—immense, short-lived, catastrophic floods—have the 
potential to trigger climate change, by altering ocean circulation on 
Earth’ or by episodically generating oceans on Mars*, 

Although bedrock canyons and associated depositional landforms 
provide direct evidence for palaeo-floods, reconstructing the hydraulics 
of outburst floods is challenging. Outburst floods often initiate sub- 
glacially” or, in the case of Mars, from subsurface aquifers"; hence, 
canyon geometry and the grain size of flood-transported sediment 
often provide the only clues from which water depth, bed shear 
stress and discharge can be reconstructed. Outburst floods in the 
Channeled Scablands and on Mars have traditionally been assumed 
to fill the canyons up to the elevation of high-water marks such as 
eroded channel margins" along the rims of the canyons. Consequently, 
most hydraulic modelling efforts implicitly assume that the canyons 
were filled with water to their brims!" or attempt to match modelled 


flood depths to the brim-full markers'*'*"°. However, the actual flow 
depth is unknown!” because canyon floors must have progressively 
lowered as a result of bedrock incision, and high-water markers may 
have been active only early in canyon formation; therefore, the dis- 
charges predicted from ‘brim-full’ models are reported as maximum 
estimates! "3, 

In the case of other bedrock canyons that formed over millions of 
years, such as the Grand Canyon, USA, water flows that were responsi- 
ble for carving the canyon had depths that were always a small fraction 
of the canyon relief. Recent mechanistic studies?" and theory for 
bedrock erosion by plucking of blocks from the bed” or toppling at 
waterfalls? indicate that, where rock is fractured, bedrock channels 
will incise via the entrainment of bedrock blocks from the bed when 
fluid stresses exceed the threshold for entrainment. This ‘threshold 
shear stress’ model implies that flows in outburst-flood-carved 
canyons will drop below brim-full as canyons deepen. Resolving 
whether canyons carved by outburst floods should be interpreted 
as channels filled to the brim or as valleys with flow only near their 
bottoms is key for reconstructing palaeo-flood discharge. However, 
evaluating these two ‘end-member’ models is challenging; both require 
removal of rock, but information on how canyons evolve during 
formation is lacking. 

‘We conduct a quantitative evaluation of the end-member brim-full 
and threshold shear stress models by numerically simulating floods in 
Moses Coulee, a 70-km-long canyon in the Channeled Scablands (Fig. 1) 
with evidence of flooding 175 m above the canyon floor and a history 
of at least four late-Pleistocene-epoch outburst floods. We selected 
Moses Coulee for the study because, unlike other canyons in the 
Channeled Scablands, the planform geometry is relatively simple and 
a set of bedrock terraces enables reconstruction of the palaeo-channel 
bed during canyon incision. Plucking is the dominant erosion mecha- 
nism in the well-jointed basalt bedrock of the Channeled Scablands*; 
however, forward-modelling the co-evolution of hydraulics and bed 
topography in a megaflood channel that is eroding by plucking is cur- 
rently intractable, owing to computational expense and the fact that 
the physics of wall and bed erosion are not well known”, Instead, we 
conducted simulations for the modern Moses Coulee topography 
and explored how flooding evolved in concert with canyon incision 
by routeing floods through four inferred stages of canyon formation 
(Fig. 1c), for which we reconstructed palaeo-topography guided by 
bedrock terraces (see Methods). Floods were simulated in discharge 
increments of 0.1 x 10°m* s~! using a two-dimensional, depth- 
averaged, hydraulic model. The elevations of high-water marks 
identified in the field were used to constrain the discharges predicted 
by the brim-full model. The discharges predicted by the threshold shear 
stress model were determined from simulations in which bed stresses 
only slightly exceeded the threshold for incision via bedrock-block 
plucking, with lower-bound estimates based on the assumption 
of cohesionless blocks and upper-bound estimates that include 
interlocking via the inclusion of block wall friction. 
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Figure 1 | The Moses Coulee study site. a, Moses Coulee in eastern 
Washington, USA (location indicated by the smaller boxed region in the 
inset). Blue areas were inundated by late-Pleistocene floods and the white 
line delineates the southern extent of the Cordilleran ice sheet. Flood and 
ice extent are interpreted from version 3.0 of the Washington Department 
of Natural Resources digital 1:250,000 scale state geological map. b, Moses 
Coulee with modelled bed shear stresses (see colour scale in d) during a 
flood with a discharge of 0.6 x 10°m* s~'; the model domain is outlined 


The threshold shear stress model for bedrock canyons stems from 
theoretical and field evidence from gravel-bedded rivers that demon- 
strates that, on average, bed shear stresses can exceed the threshold 
for sediment transport by only 20% before the channel cross-section 
will erode and evolve to a form that maintains shear stresses near the 
threshold for sediment transport, Hence, well-jointed rock that is 
subject to megaflooding may behave more like a bed of sediment 
with limited or no cohesion in which individual blocks can be easily 
entrained and transported, rather than as massive crystalline rock 
that erodes slowly through abrasion®“25, Such bedrock channels 
cannot withstand large discharges or shear stresses that greatly exceed 
the threshold for plucking. Instead, the channels will adjust their 
morphology through bedrock erosion so that bed stresses, on average, 
only slightly exceed the threshold for block erosion. 

The discharges predicted by the two models are tested by assessing 
the temporal evolution of discharge during canyon incision, because 


Figure 2 | Depositional landforms in Moses Coulee. a, The Great Gravel 
Bar in upper Moses Coulee overlain by a water-depth map for a simulated 
flood with a discharge of 0.6 x 10°m? s~!.b, Aerial photograph showing 
the location of the abandoned channel boulder bar (outlined in white). 
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in green. ¢, Long profile of Moses Coulee showing the modern bed 
topography (solid black line), mapped bedrock terraces (grey circles) and 
the bed topography reconstructed from the terrace elevations (grey lines 
labelled A—D). The rim of the canyon is indicated by the dashed black line. 
Elevation is with respect to sea level. d, The study reach. The white circle 
shows the location of the field-defined high-water mark; the blue line 
shows the high-water line interpreted from aerial imagery. The line A-A' 
shows the location of the cross-section in Fig. 4. 


the brim-full model implies a temporal increase in discharge to 
maintain brim-full flow while the canyon floor erodes. However, in 
the Channeled Scablands, at least tens of floods are thought to have 
occurred", and all but the most recent floods probably had comparable 
discharges, owing to a triggering mechanism that required filling an 
ice-dammed lake to a threshold level sufficient to float the dam and 
release an outburst”. A second test uses the depositional bars within 
Moses Coulee, because these require certain levels of inundation and 
sediment transport regimes to form”, by determining which model 
predictions are most consistent with the location and morphology 
‘of two key landforms: the Great Gravel Bar (Fig. 2a) and a boulder 
bar on an ‘abandoned channel, a terrace 70m above the canyon floor 
(Fig. 2b, ¢). 

The discharge predicted by the brim-full model increases from 
0.1 x 10°m* s~! to a peak of 3.0 x 10°m* s~! (Fig. 3a) during canyon 
formation; the peak discharge value during the final stage of canyon 


bandanéd 


Aerial imagery courtesy of the US Department of Agriculture. ¢, Field 
photo of the boulder deposit shown in b; the median grain size of clasts on 
the bar is 0.15m. The photo is taken near the tip of the arrow in b and the 
view is downstream; person (circled) for scale, 
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Figure 3 | Simulated discharge and water depths. a, b, Discharge (a) and 
water depths (b) predicted by the brim-full model, by the threshold shear 
stress model, and from criteria related to the initial motion of sediment. 
‘The shading shows the range of predicted values based on upper- and 
lower-bound parameterizations of the critical dimensionless shear stress 
for bedrock incision by block sliding (see Methods). The letters A~D 
indicate simulation results for bed elevations A-D; M denotes results for 
the modern topography. Relative thalweg elevations denote the elevation 
of the palaeo-channel floor above the modern channel floor. 


formation is of similar order to previous estimates for other Channeled 
Scablands canyons'!, By contrast, the discharges predicted by the 
threshold shear stress model are lower by approximately an order of 
magnitude, with incision from the upper-most terrace to the canyon 
floor achieved by discharges of at most 0.6 x 10°m* s~! (Fig. 3a), and 
the predicted discharges change only slightly during canyon formation. 
Water depths predicted by the threshold shear stress model are 
27-76m, whereas the brim-full hypothesis predicts increasing water 
depths during canyon formation with a peak at 175m (Fig. 3b). The 
narrow range of discharge rates predicted by the threshold shear stress 
model is consistent with the incision occurring during a single flood® 
or during multiple floods of similar magnitude®, as would be expected 
for floods issuing repeatedly from the same ice-dammed lake™. 

‘The Great Gravel Bar in upper Moses Coulee is inundated at a 
modelled discharge of 0.6 x 10°m* s~ (Fig. 2a), which is consistent 
with the upper discharge estimate predicted by the threshold shear 
stress model and with observations that indicate that bars tend to 
aggrade to near the height of the flood”. The bar at the abandoned 
channel site is made up of well-rounded basalt clasts with a median 
diameter of 0.15 m, indicating that they were transported as bedload. 
Bar deposition by bedload is consistent both with discharges predicted 
by the threshold shear stress model (because modelled bed shear 
stresses at the bar just exceed the threshold of motion) and with the 
formation regime of bars in rivers with coarse-grained bed sediment”. 
Shear stresses predicted by the brim-full model would have exceeded 
the threshold of suspension for the majority of the boulders on the 
bar (which have diameters of less than 0.5m), which is inconsistent 
with the field evidence for bedload deposition. The morphology and 
sedimentology of the depositional bars are therefore more consistent 
with the threshold shear stress end-member than with the brim-full 
end-member. 

Bed stresses on some parts of the knickzone in the central part of 
Moses Coulee are several-fold higher than the thresholds for plucking, 
for discharges that are consistent with the threshold shear stress model 
(Fig. 1b, d). Ifflood waters no longer reached the height of the terraces 
because they were abandoned by lowering of the channel bed as the 
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Figure 4 | Co-evolution of topography and flood level predicted by the 
brim-full and threshold shear stress models. Qi, Qus and Qua refer to 
discharges predicted by brim-full, threshold shear stress and sediment 
initial motion criteria, respectively, with units of 10° m* s~*. The location 
of the cross-section (A~A’) is shown in Fig, 1d. 


knickzone retreated upstream, as is probably the case, then the higher 
shear stresses on the steep channel bed indicate that the knickzone 
was probably a site of transient, rapid erosion, provided that the well- 
jointed Columbia River basalts are indeed mechanically similar to 
cohesionless or weakly interlocked bedrock blocks. 

‘The threshold shear stress model implies that canyons in the 
Channeled Scablands were eroded by floods with depths that were a 
fraction of the relief of the final canyon (Fig. 4). This physics-based 
finding is consistent with several recent investigations of canyon 
carving at other sites on Earth and Mars: for example, those where 
bedrock incision by plucking or toppling of jointed rock occurs at 
depths less than brim-full"*"™2, those where terrace chronology indi- 
cates multiple episodes of canyon incision’, or those where lakes in 
breached craters contain insufficient water volumes to fill downstream 
channels”, 

Our results suggest that the morphology of canyons (for example, 
terraces, valley shapes and slope profiles) on Earth and Mars could 
reveal information about both the history and discharge of flooding 
that warrants further investigation. The outburst floods that carved the 
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Channeled Scablands were extraordinary under either end-member 
model, but predictions of discharges from the threshold shear stress 
model are five- to ten-fold smaller. On Mars, owing to the low permea- 
bility of aquifers, it has been challenging to reconcile the very large 
reconstructed brim-full discharges in outflow channels with a subsurface 
flood source". Given the proposed similarity in incision mechanics for 
outflow channels on Mars and in the Channeled Scablands*!62°2, 
the threshold shear stress model provides a link between the physics 
of groundwater-sourced floods and terraces observed in orbital data”, 
implying longer duration, lower discharge floods, or multiple floods on 
early Mars". 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only inthe online paper. 
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METHODS 

Hydraulic modelling. Outburst floods through Moses Coulee were simulated 
using ANUGA version 2.0, a finite-volume hydrodynamic model that conserves 
mass and momentum by solving the two-dimensional, time-dependent, depth- 
averaged, shallow-water equations on a triangular mesh*'**, ‘The size of the 
triangular mesh was varied spatially, with a maximum triangle area of 900m? for 
the main study reach and 5,000 m? elsewhere (Extended Data Fig. 1). 

Floods were simulated with five different topographic boundary conditions: 
the modern topography and four topographies with reconstructed bed elevations 
(A-D). We simulated the hydraulic conditions through inferred stages of canyon 
evolution that we reconstructed from terraces, The reconstructed bed elevations 
‘were based on the elevations of bedrock terraces found downstream of a prominent 
kknickzone and maintain the same bed slope as the channel upstream from the 
knickzone (Fig. 1; Extended Data Fig. 2). Although we do not explicitly simulate 
lateral knickpoint retreat, which probably accounted for much of the erosion of 
Moses Coulee, the decrease in bed elevation that we impose for different simula- 
tions mimics the vertical bed lowering that would have occurred as knickpoints 
migrated upstream, past the reach with preserved terraces, Ifslopes were steeper 
than the modern topography during canyon incision and erosion was limited to 
steep reaches”, then the discharges predicted by our threshold shear stress model 
are upper estimates; the hydraulic simulations indicate that even the smallest floods 
‘we simulated produce shear stresses across the knickzone that exceed the threshold 
for block sliding (Extended Data Fig. 3). 

‘The topographic data were from 10-m digital elevation model (DEM) data 
derived from US Geological Survey (USGS) topographic maps. DEM and recon- 
structed topographic data were used by ANUGA to generate the triangular mesh. 

ANUGA implements bed friction with Manning's roughness coefficient (1) 
We determine the roughness coefficient by equating a form of the Manning- 
Strickler relation®® 


and the Manning equation 


and re-arranging for n: 


Here wis flow velocity, u, = .[C;uis bed shear velocity, Cris a friction coefficient, 
gisacceleration due to gravity, his flow depth, Sis the tangent of the channel-bed 
angle and k, isa bed-roughness length scale, We assume that k, follows a relation 
proposed for bedrock channels: 


Ky rare oe 


where ry and ny are hydraulic roughness scaling parameters and ayy is one 
standard deviation of the elevation of the bedrock bed™, We set ry=2 and assume 
Tue=2, which is within the range of reported values. ‘The mean of ay, ftom five 
different reaches in our study area was about 5m; consequently, we determined 
that n= 0.065 and assume that this roughness coefficient is spatially uniform. The 
USGS 10-m DEM was used to determine roughness. 

For each flood simulation, a constant discharge was input at the upstream 
boundary of the model domain and flow evolved within the domain before exiting 
at the downstream boundary”. The duration of each simulation was 10,000, 
which was sufficient for the flow to evolve to near steady-state (Extended Data 
Fig, 4), Model outputs for flood elevation (stage) and the x and y components of 
momentum and velocity were saved every 1005 and gridded to a pixel resolution 
‘of 30m x 30m, a value selected because it is comparable to the maximum triangle 
area in the study reach. The 10 grids from the final 1,000 of each simulation 
were averaged. From the averaged grids, flow depth (fi) was calculated as the 
difference in mean flood stage and bed topography, streamwise velocity (1) was 
determined by the vector sum of the x and y velocity components, and bed shear 
stress (74) was calculated as 7)= pCuu?, where p is the density of water. The friction 
coefficient is related to Manning's n by 


gn 


CT 


Discharge predicted by the brim-full model. The discharge predicted by the 
brim-full model was determined by iteratively simulating floods in discharge 
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increments of 0.1 x 10°m? s~ through the modern topography and identifying the 
discharge scenario in which the simulated flood stage reached the brim-full level 
determined from field evidence, Field evidence for the elevation of the brim-full 
level was a scarp cut into loess with rounded basalt clasts deposited at its base, 
Discharge predicted by the threshold shear stress model. Theoretical and field 
evidence from gravel-bedded rivers demonstrates that bed shear stresses can exceed 
the threshold for sediment transport by a factor of only about 1.2 on average before 
the channel cross-section erodes and evolves toa form that maintains shear stresses 
near the threshold for sediment transport”. Ultimately, to form a stable alluvial 
channel, the channel banks must be below the threshold for erosion, whereas the 
bed must be above the threshold for transport. Ifthe bed and banks are composed 
‘of the same material, which is the case both for gravel-bed rivers and the Channeled 
Scablands, then the channel-forming shear stress for self formed channels cannot 
greatly exceed the conditions required for sediment entrainment. The similarity in 
low excess shear stress (factor of 1.11.6) required to erode gravel-bedded channels 
and the channel at Canyon Lake Gorge, Texas®, suggests that the mechanics of 
erosion and sediment transport in gravel-bedded rivers are similar to those in bed- 
rock channels formed in well-jointed bedrock, where plucking of bedrock blocks 
is the dominant erosion mechanism®""", leading to the threshold shear stress 
end-member model for canyon formation. A similar threshold stress mechanism 
has also been proposed at waterfalls dominated by toppling, which are common 
to Scabland terrain”, 

‘The threshold shear stress model predicts that channel bed stresses should be 
at or near the threshold for plucking, but local deviations are observed (Fig. 1) and 
expected fora variety of reasons, consistent with observations in threshold alluvial 
rivers. Anomalies are likely to occur where areas of the bed are eroding faster, such 
as in knickzones, in areas where heterogeneity in rock strength both within and 
among basalt flows leads to differing fracture spacing and block size (leading to, for 
‘example, differences in canyon width), and in areas where the formation of back- 
‘waters locally lowers bed stresses. For example, our simulations assume a constant 
block size of 0.5m; accounting for a wider range of block sizes would produce an 
equally wider range of expected threshold stresses for plucking (equation (1)).. 
‘The threshold shear stress model predicts that, over time, the topography is likely 
to evolve such that bed protrusions are eroded and knickpoints are transient and 
retreat, such that shear stresses, on average, tend towards the threshold values for 
erosion. For Moses Coulee, 67% of the terrain has simulated bed stresses within the 
bounds for plucking blocks of the size found in depositional bars in the Channeled 
Scablands (0.13-0.83m), which is consistent with the threshold shear stress model; 
for brim-full flow, only 33% of the terrain has simulated bed stresses within the 
plucking bounds for the same range of block sizes (Extended Data Fig. 5). 

Plucking—the removal of bedrock blocks—is the dominant erosion mechanism 
in the well-jointed basalt bedrock of the Channeled Scablands*™, Experimental 
studies have shown that downstream sliding of blocks can be the dominant form 
of plucking”. Therefore, to evaluate the discharges predicted by the threshold 
shear stress model, we first determined the dimensionless critical shear stress for 
block sliding: 


where U/is the bed angle, cis the bed friction angle, ry, isthe dimensionless block 
sidewall stress, Cp is the local drag coefficient, Pis the block protrusion height (or 
‘equivalently the roughness on the top of the block), Lis the block length and Fj. is 
the dimensionless hydraulic lift force", We set = 1° on the basis of measurements 
of the topographic slope outside of the channel upstream from the terraces in 
Moses Coulee. The friction angle was assumed to be 34°, consistent with the range 
‘of values (31°-36°) for wet basalt™. The ratio of block protrusion height to block 
length (P/L) was set to 0.2 on the basis of field observations. The quantities Cp, 
lu, and Fj, were taken to be 1, 8.3 and 0.85, respectively", Dimensionless block 
sidewall stress is poorly constrained, so we varied the values between 0 and 0.2 to 
predict a range of potential values for 75... The Columbia River basalts are jointed 
and fractured, commonly exhibiting colonnade and entablature structures", The 
fractures reduce the cohesive strength of the Columbia River basalts by one or two 
orders of magnitude relative to intact basalt™, A dimensionless block sidewall stress 
‘of 0 is appropriate for the wide, open joints we observed between basalt columns 
‘hile in the field, and 0.2 corresponds to interlocking or cohesion between blocks 
‘with a wall stress that is 20% of the block weight per unit area. Greater assumed 
cohesive strength or interlocking would lead to higher discharge estimates for the 
threshold shear stress model. The range of 75, was then used to calculate upper 
and lower bounds on the critical shear stress for block sliding: 
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where p, is the density of the basalt (2,800kg m~*) and D is the block height” 
‘We set D=0.5m, a value consistent with the dimensions of basalt columns” in the 
(Channeled Scablands and that falls within the range of median intermediate-axis 
diameters (Dso) of boulders we measured on the Ephrata Fan (Ds) =0.30m, sample 
size N=138) and in Drumheller Channels (Dsy =0.59m, N=301) (Extended Data 
Fig. 6). The Ephrata Fan and Drumheler Channels sites are located in a different 
floodway approximately 30km and 70km to the southeast of Moses Coulee, 
respectively". The lower and upper shear stress bounds for erosion by sliding 
calculated by this method (based on varying the end-member block sidewall stress 
values from 0 to 0.2) are 467 Pa and 751 Pa, respectively. 

Unlike the brim-full discharge model, which predicts discharge for the length 
ofan entire channel reach, the threshold shear stress model makes predictions, 
for local bed shear stress within the channel. We therefore determined the mean 
and standard deviation of shear stress at 12 locations placed approximately along, 
a cross-section within the study reach, which span an elevation range from the 
channel floor to the highest terrace (Extended Data Fig. 7). To determine the 
discharges predicted by the threshold shear stress model, we used the upper 
and lower shear stress bounds for plucking by sliding shown in Extended Data 
Fig, 8to define the widest possible bounds on flood discharge that satisfy the model 
conditions for all eroding sites. 

For a given topographic reconstruction, corresponding to the modern 
topography or to one of our four inferred stages of canyon incision, we first 
used the upper and lower thresholds for plucking to define the range of possi- 
ble discharges at the lowest elevation site that are consistent with the threshold 
shear stress model. For the modern topography, for example, the lowest site cor- 
responds to the modern canyon floor (Extended Data Fig. 7). We then found the 
discharge range, following the same procedure, at sites with progressively higher 
elevations that produced bed stresses that were within the bounds for plucking 
at the site of interest and also for all lower elevation sites (Extended Data Fig. 9). 
‘These two criteria are important because, for a flood discharge to be consistent 
with the threshold stress model, all inundated and eroding sites from the canyon 
floor upward must have bed stresses that are within the bounds for plucking. Most 
brim-full discharge scenarios violate these constraints because they produce bed 
stresses on the canyon floor that greatly exceed the bounds on plucking (Extended 
Data Fig, 8), For the threshold shear stress model, ithe discharge required to pluck. 
blocks at a higher elevation site exceeds that of a lower elevation site, then the 
higher sites were considered to be dry and abandoned and therefore were not used 
to define the range of discharges for that stage of canyon incision (Extended Data 
Fig. 9). This analysis therefore allows for identification, for a given topographic 
reconstruction, of the maximum range in flood discharges that is consistent with 
the threshold shear stress model, for both the canyon floor and all higher elevation 
surfaces that could have been inundated. The procedure was repeated for the 
modern topography and the topographies for each of the four stages of canyon 
incision to define discharges, flow depths, cross-sectional areas and channel widths 
(Fig. 3, Extended Data Fig. 10) 

‘Thresholds for sediment transport and suspension. The abandoned channel on 
the south side of the study reach contains a gravel-boulder bar with well-rounded 


lasts (Fig. 2b, c). We measured the intermediate diameter of 220 individual clasts 
measured at I-m spacing along a grid (excluding loess deposited after flooding) 
and calculated Dsy=0.15m and Dy, =0.41 m (84th percentile of clast diameters 
are finer than this size) (Extended Data Fig. 6). We calculated the critical shear 
stress for initial sediment motion (r,) by substituting rj = 0.045 for rand Dso 
for Din equation (1). The resulting critical shear stress for initial motion is 119 Pa. 
‘The threshold for sediment suspension was calculated as"? 7,= p(0.4,)?, where 
1w, the terminal settling velocity, is defined as 


Cw + [0.75C2RgD' 


in which R is submerged specific gravity (1.8 for basalt in water), Ci =20 and C;= 1.1 

are constants representing natural sediment, and vis the kinematic viscosity of 
‘water, The shear stress threshold for suspension of Dsyis 513 Paand for D=0.5mis 
1,712 Pa, which is similar to the modelled brim-full shear stress atthe bar for the mod- 
ern topography (1,706 Pa), but inconsistent with bar deposition by bedload transport, 
Code availability. The hydrodynamic code ANUGA is open-source and 
available for download at https://anuga.anu.edu.au/. The PYTHON scripts used 
to implement ANUGA are available from the authors by request. 

Data availability. Digital elevation data are available from the University of 
‘Washington Geomorphological Research Group website (http://gis.ess.washington, 
edu/data/). All simulation results and data are available from the authors by request. 


1, Roberts, S.G., Nielsen, O. M. & Jakeman, J. in Modeling, Simulation and 
Optimization of Complex Processes (eds Bock, HG. et al) 489-498 (Springer, 
2008). 

32, Roberts, S, Nielsen, 0, Gray, D,, Sexton, J. & Davies, G, ANUGA User Manual 
Release 2.0. https://github.com/GeoscienceAustralia/anuga_core/raw/master/ 
doc/anuga_user_manual.paf (2015). 

33, Parker, G. Selective sorting and abrasion of river gravel. 
J Hydraul, Eng. 117, 150-171 (1991). 

34, Johnson, J.P.A surface roughness model for predicting alluvial cover and bed 
load transport rate in bedrock channels. J. Geophys. Res. Earth Surface 119, 
2147-2173 (2014), 

35, Dubinski |. M, & Wohl, E. Relationships between block quarrying, bed shear 
stress, and stream power: a physical model of block quarrying of a jointed 
bedrock channel. Geomorphology 180-181, 66-81 (2013). 

36, Selby, M. J. Hillslope Materials and Processes 93 (Oxford Univ, Press, 1993), 

37. Long, P.E. & Wood, B. J. Structures, textures, and cooling histories of Columbia 
River basalt flows. Geo! Soc. Am. Bull, 97, 1144-1155 (1986). 

38, Schultz, R. Limits on strength and deformation properties of jointed basaltic 
rock masses. Rock Mech, Rock Eng. 28, 1-15 (1995). 

39, Ehlmann, B.L, Viles, H.A. & Bourke, M. C. Quantitative morphologic analysis 
of boulder shape and surface texture to infer environmental history: a case 
study of rock breakdown at the Ephrata Fan, Channeled Scabland, Washington. 
4. Geophys. Res. Earth Surface 113, FO2012 (2008). 

40. NiNo, ¥, Lopez, F & Garcia, M. Threshold for particle entrainment into 
suspension. Sedimentology 50, 247-263 (2003). 

41. Ferguson, R. & Church, M.A simple universal equation for grain setting 
velocity. J. Sediment. Res. 74, 933-937 (2004). 


I: applications. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved 


LETTER 


aS 


D 


Extended Data Figure 1 | Computational mesh resolution. a, Map of the 
‘computational domain (green outline). Black square shows the location 
of b. b, Map showing the different triangular mesh resolutions within the 
computational domain, with a maximum triangle area of 900 m? within 
the white polygon in a and 5,000 m? throughout the rest of the domain. 
We used a smaller triangle area in the study reach (within the white 
polygon) to better resolve spatial variability in shear stresses and a slightly 
larger triangle area elsewhere for computational efficiency. 
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Extended Data Figure 2 | Hillshade m: 

bed elevation, These maps depict the reconstructed topography in Moses 
Coulee, which is based on the elevation of terraces and the modern 
channel bed upstream from the knickzone. a, b, Bed elevation D; ¢, d, 
bed elevation C; ¢, f, bed elevation B; g, h, bed elevation A; i,j, modern 
topography. 
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Extended Data Figure 3 | Bed shear stress as a function of discharge for sites on the Moses Coulee knickzone, Data correspond to locations shown 
in Extended Data Fig. 7. For nearly all simulated discharges, the modelled shear stresses on the knickzone exceed the thresholds for plucking via block 
sliding, indicating that the knickzone would probably have been rapidly eroding during floods or that it is made up of stronger rock. 
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Extended Data Figure 4 | Flood depth versus time for a typical simulation. The example shown is a flood with a discharge of 10°m* s~! routed through 
the modern topography. The grey box shows the final ten time-steps, from which the model results (stage, velocity, shear stress and so on) were extracted 
and used to produce time-averaged grids to reduce the influence of transient waves relative to data from a single time-step. 
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Extended Data Figure 5 | Cumulative bed stress probability density 
functions. The light grey bar shows the lower (467 Pa) and upper 

(751 Pa) threshold shear stress bounds for plucking of 0.5-m blocks. 

‘The dark grey bar shows the threshold stress bounds (117 Pa and 1,242 Pa) 
assuming a wider distribution of block sizes of 0.13-0.83 m, based on 

the Dye and Dy, (16th and 84th percentiles) of basalt clasts at the Ephrata 
Fan and Drumheller Channels sites, respectively (Extended Data Fig. 6). 
Cumulative probability density functions are shown for the brim-full flood 
(discharge of 3.0 x 10°m* s~'; black line) and the lower (0.3 x 10°m? s~*; 
red line) and upper (0.6 x 10°m* s~'; blue line) flood discharge bounds 
predicted by the threshold shear stress model for the modern Moses 
Coulee topography, Data are extracted from each grid cell within the 
entire length of Moses Coulee (Fig. 1b). For the brim-full flood, only 8% 
of the terrain of Moses Coulee has modelled bed shear stresses within 

the bounds for plucking 0.5-m blocks, and this increases to 33% when 
considering the wider range of block sizes; some areas have extremely high 
bed stresses of more than 10,000 Pa. In contrast, for discharges predicted 
by the threshold shear stress model, 25% of the terrain has shear stresses 
within the bounds for plucking 0.5-m blocks, and this increases to 67% 
when considering a wider distribution of block sizes. This high proportion 
of bed stresses within the plucking threshold range, relative to the brim- 
full flood, is consistent with the hypothesis that the channel adjusts so that 
a large portion of the terrain is near the threshold state during megaflood 
incision, Note that the study locations in Extended Data Fig. 7 were used 
to define the discharge bounds for the threshold shear stress model, so all 
of those locations have bed stresses within the plucking threshold range by 
definition (see Extended Data Fig. 8). 
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Extended Data Figure 6 | Boulder size data, Cumulative grain-size Drumheller Channels sites are to the southwest of Moses Coulee and in 
distribution for the Moses Coulee abandoned channel boulder bar a different flood pathway. The larger boulders at the Ephrata Fan and 
(circles), Ephrata Fan basalt clasts (squares) and the Drumheller Channels Drumheller Channels are assumed to originate as basalt columns that have 
boulder bar (triangles). Measurements were made of the intermediate been rounded by fluvial transport. 


grain diameter using a Wolman-style pebble count. The Ephrata Fan and 
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‘Extended Data Figure 7 | Locations of the areas used to calculate mean bed shear stress. The 30,000-m? rectangles near the A~A’ cross-section were 
used to constrain discharges predicted by the threshold shear stress model. The larger 90,000-m: polygons (labelled 1-6) were used to determine the 
shear stresses on the knickzone; the numbers correspond to the data in Extended Data Fig. 
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and lower shear stress thresholds for block sliding. Round symbols discharge for which modelled shear stresses rd the lower shear stress 
depict the mean and bars depict one standard deviation (n = 33) of bed threshold for sliding. The high discharge bound (blue symbol) is defined 


shear stress values extracted from one of the twelve 30,000-m* polygons _as the lowest discharge with modelled shear stresses that exceed the upper 
shown in Extended Data Fig. 7. In this example, the lower discharge bound shear stress threshold for sliding 
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Extended Data Figure 9 | Schematic illustrating the method for 
defining the discharge range predicted by the threshold shear stress 
model. For a given topographic reconstruction we first used the upper 
and lower bed stress thresholds for block plucking via sliding to define 
the range of possible discharges that are consistent with the threshold 

stress model at the lowest elevation site (Extended Data Fig. 7). The 
lowest elevation was always the canyon floor. We then followed the same 
procedure to determine the discharge range at sites with progressively 
higher elevations (for example, sites 2-5) that produced bed stress that 
were within the bounds for plucking at the site of interest and for all lower 
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elevation sites. The range of discharges determined by these two criteria 
is shown by the green box. These two criteria are important because, for 
a flood discharge to be consistent with the threshold shear stress model, 
all sites that are inundated and eroding, from the canyon floor upward, 
must have bed stresses that are within the bounds for plucking. Ifthe 
discharge required to pluck blocks at a higher elevation site exceed that of 
a lower elevation site, then the higher sites were considered to be dry and 
abandoned and so were not used to define the range of discharges for that 
stage of canyon incision, 
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Extended Data Figure 10 | Predicted channel cross-sectional areas and 
widths. a, b, Cross-sectional area (a) and channel width (b) predicted 

by the brim-full model and threshold shear stress model. The shading 
shows the range of predicted values based on upper- and lower-bound 
parameterizations of the critical dimensionless shear stress for bedrock 
incision by block sliding (see Methods). For the initial topography (bed 
elevation D), where basalt is primarily overlain by loess, modelled shear 
stresses are lower than the threshold for plucking, At bed elevation C, 
modelled shear stresses at brim-full discharge are within 5% of the lower 
threshold for plucking; hence, bed stresses are assumed to be sufficient for 
plucking when flow was brim-full. For all other bed topographies, brim- 
full discharge greatly exceeds predicted values for plucking and sediment 
initial motion. The letters A-D denote simulation results for bed elevations 
A-D; M denotes results for the modern topography. 
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The phylogenetic roots of human lethal violence 


José Maria Gomez!, Miguel Verdti’, Adela Gonzalez~Megias* & Marcos Méndez* 


The psychological, sociological and evolutionary roots of 
conspecific violence in humans are still debated, despite attracting 
the attention of intellectuals for over two millennia’. Here we 
propose a conceptual approach towards understanding these roots 
based on the assumption that aggression in mammals, including 
humans, has a significant phylogenetic component. By compiling 
sources of mortality from a comprehensive sample of mammals, 
we assessed the percentage of deaths due to conspecifics and, 
using phylogenetic comparative tools, predicted this value for 
humans. The proportion of human deaths phylogenetically 
predicted to be caused by interpersonal violence stood at 2%. 
This value was similar to the one phylogenetically inferred for 
the evolutionary ancestor of primates and apes, indicating that a 
certain level of lethal violence arises owing to our position within 
the phylogeny of mammals. It was also similar to the percentage 
seen in prehistoric bands and tribes, indicating that we were as 
lethally violent then as common mammalian evolutionary history 
would predict. However, the level of lethal violence has changed 
through human history and can be associated with changes in 
the socio-political organization of human populations. Our study 
provides a detailed phylogenetic and historical context against 
which to compare levels of lethal violence observed throughout 
our history. 

Debate on the nature of human violence has been ongoing since 
before the publication of Leviathan by Thomas Hobbes in 1651. 
Lethal violence is considered by some to be mostly a cultural trait*"?; 
however, aggression in mammals, including humans!"", also has a 
genetic component with high heritability. Consequently, it is widely 
acknowledged that evolution has also shaped human violence, 
From this perspective, violence can be seen as an adaptive strategy, 
favouring the perpetrator’s reproductive success in terms of mates, 
status or resources'™'®, Yet this does not mean that violence is invariant 
or even adaptive in all situations". In fact, given that the conditions 
under which violence benefits evolutionary fitness depend on the 
ecological and cultural context, levels of violence tend to vary among 
human populations!23546, Disentangling the relative importance 
of cultural and non-cultural components of human violence is 
challenging®* owing to the complex interactions between ecological, 
social, behavioural and genetic factors. 

Conspecific violence is not exclusive to humans, Many primates 
exhibit high levels of intergroup aggression and infanticide", Social 
carnivores sometimes kill members of other groups and commit 
infanticide when supplanting older members of the same group”, 
Even seemingly peaceful mammals such as hamsters and horses 
sometimes kill individuals of their own species!?°, The prevalence 
of aggression throughout Mammalia raises the question of the extent 
to which levels of lethal violence observed in humans are as expected, 
given our position in the phylogenetic tree of mammals. In this study, 
Wwe quantified the level of lethal violence in 1,024 mammalian species 
from 137 families (Supplementary Information section 9a) and in over 


600 human populations, ranging from the Palaeolithic era to the present 
(Supplementary Information section 9c). The level of lethal violence 
was defined as the probability of dying from intraspecific violence 
compared to all other causes. More specifically, we calculated the level 
of lethal violence as the percentage, with respect to all documented 
sources of mortality, of total deaths due to conspecifics (these 
were infanticide, cannibalism, inter-group aggression and any other 
type of intraspecific killings in non-human mammals; war, homicide, 
infanticide, execution, and any other kind of intentional conspecific 
killing in humans). 

Lethal violence is reported for almost 40% of the studied mammal 
species (Supplementary Information section 9a). This is probably 
an underestimation, because information is not available for many 
species. Overall, including species with and without lethal violence, 
we found that the percentage of deaths due to conspecifics was 
0,30 £0.19% of all deaths (phylogenetically corrected mean+s.e.m). 
This level was not affected by the number of individuals sampled 
per species (Supplementary Information section 1). These findings 
suggest that lethal violence, although infrequent, is widespread among 
mammals'*-!, 

We determined whether related species tended to have similar levels 
of lethal violence by calculating the phylogenetic signal. We used the 
most recently updated mammalian phylogenies, including 5,020 
extant mammals” and 5,747 extant and recently extinct mammals”, 
We found a significant phylogenetic signal for lethal violence, even 
after combining disparate causes of intraspecific killings (\> 0.60, 
P< 0.0001; Supplementary Information section 2). While lethal 
violence was uncommon in certain clades such as bats, whales and 
lagomorphs, it was frequent in others, such as primates (Fig. 1). The 
phylogenetic signal was also significantly lower than one (P< 0.0001), 
indicating that lethal violence exhibits certain evolutionary flex- 
ibility (Fig. 1). For example, the level of lethal violence strongly 
differs between chimpanzees (Pan troglodytes) and bonobos 
(Pan paniscus)"°"?", This outcome suggests that additional factors 
may subsequently modify the level of lethal violence in related species, 
Territoriality and social behaviour mediate conspecific aggression in 
mammals"°#, We scored these two traits for every mammal in our 
study and statistically related them to the level of lethal violence using 
phylogenetic generalized linear models. Using this method, we found 
that the level of lethal violence was higher in social and territorial 
species than in solitary and non-territorial species (Fig. 2; Extended 
Data Table 1). 

The occurrence of a phylogenetic signal for lethal violence in 
mammals enables the phylogenetic inference of lethal violence in 
humans. We used ancestral-state estimation methods that infer the 
value of a trait in any extant species according to its position in the 
phylogenetic tree®®, The level of human lethal violence was estimated 
both with and without considering the territoriality and sociability of 
mammals. Because phylogenetic inferences are much more accurate 
and reliable when including information from close relatives” 
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Figure 1 | Evolution of lethal aggression in non-human mammals. 
‘Tree showing the phylogenetic estimation of the level of lethal 
aggression in mammals (n= 1,024 species) using stochastic mapping, 
Lethal aggression increases with the intensity of the colour, from 
yellow to dark red. Light grey indicates the absence of lethal aggression. 
Mammalian ancestral nodes compared with human lethal violence are 
shown in red, whereas main placental lineages are marked with black 
nodes. The red triangle indicates the phylogenetic position of humans. 
‘The silhouettes of representative mammals (downloaded from 


and fossils", information on Homo neanderthalensis was included 
when estimating the level of human lethal violence (Supplementary 
Information section 9b). In addition, because the level of violence 
varies among populations of the same species!®7°7", all models 
include intraspecific variation in the level of mammalian lethal 
violence. The phylogenetically inferred level of lethal violence, averaging 
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http://www.phylopic.org) illustrate the main mammalian clades 
‘They are licenced for use in the Public Domain without copyright, 
except for the silhouettes of Murinae (D. Liao), Jactilus (M. Karaka), 
Philander (S. Werning), Rattus (R. Groom), Molossus (Zimices), 
Balaenoptera (C. Hoh), Rousettus (O. Peles), Connochaetes, Redunca, 
and Kobus (J. A. Venter, H. H. T. Prins, D. A. Balfour and R, Slotow), 
that are licenced under a Creative Commons 3.0 license 
(http://creativecommons.org/licenses/by/3.0). 


across all models, was 2.0+ 0.02% of all deaths (Fig. 3a). These estimates 
seem to be robust to many potential biases, such as phylogenetic 
uncertainty, phylogenetic depth, sampling effort, and phylogeny size 
(Supplementary Information sections 3-6). Territoriality and sociability 
affect the phylogenetic inference of the level of lethal violence, as it was 
1.9+£ 0.01% in the models without these two variables but 2.1 + 0.02% 
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Figure 2 | Social behaviour and territoriality influence lethal 
aggression in mammals. The figure shows the phylogenetically corrected 
level of lethal aggression per group (mean ++s.e.m) and the number of 
mammalian species included in each group. We used a phylogenetic 
generalized linear model (PGLS) to test the effect of territoriality 

(yes or no) and social behaviour (social or solitary) on lethal aggression. 
‘The level of lethal aggression was more intense in social and territorial 
species (PGLS, P < 0.05 in all cases and mammal phylogenies; Extended 
Data Table 1), with no interaction between these two terms (Extended 
Data Table 1). 


in the models including them (Fig. 3a). This is a consequence of 
H. sapiens being both social and territorial, two characteristics 
associated with a stronger tendency towards lethal violence in 
mammals (Fig. 2). 

We subsequently explored how the level of lethal violence has 
changed during our evolutionary history by comparing it with the 
phylogenetically inferred level of lethal violence in relevant ancestral 
nodes that describe the course of human evolution (Fig. 1). The level of 
lethal violence was low in the most basal nodes, increasing to 2.3 +0.19% 
ofall deaths in the two nodes closely related with the origin of primates 
and slightly decreasing to 1.8+0.1% of all deaths in the ancestral ape 
(Fig. 3b). These results suggest that lethal violence is deeply rooted in 
the primate lineage. 

‘We then compared whether the phylogenetically inferred level of 
lethal violence differed from the level empirically observed in human 
populations. The samples were categorized according to their age, using 
the standard periods from the New and Old World chronologies 
These data must be interpreted cautiously, because there was extensive 
intra-period variation in lethal violence. Nevertheless, a clear temporal 
pattern emerged (Fig. 3c). The level of lethal violence during human 
prehistory did not differ from the phylogenetic predictions (Fig. 3c). 


This result contrasts with some previous observations", probably 
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because we have included more populations in our study and weighted 
all the analyses by the number of individuals per sample. The level 
of lethal violence during most historic periods was higher than the 
phylogenetic predictions for both humans (Fig. 3c and Supplementary 
Information section 7) and the ancestral Hominoidea (Fig. 3b). 
However, on entering the Modern and Contemporary ages (defined 
in Methods), the level of lethal violence decreased markedly, as 
previously reported!" (Fig. 3c). Several potential biases may affect these 
results. The level of lethal violence inferred from skeletal remains could 
be underestimated because many deadly injuries do not damage the 
bones*, Nevertheless, no underestimation was detected for the periods 
in which both skeletal remains and statistical yearbooks are available 
(Supplementary Information section 7). Similarly, the presence of 
battlefields may artificially overestimate the level of lethal violence. 
However, the periods with highest level of lethal violence were not 
those with more organized intergroup conflicts (Supplementary 
Information section 8). Thus, the temporal pattern in the level of 
lethal violence seems to hold even after considering these potential 
biases. Concomitant changes in the cultural and ecological human 
environment may have caused this pattern, Notably, population density, 
a. common ecological driver of lethal aggression in mammals!®, was 
lower in periods with high levels of lethal violence than in the less 
violent Modern and Contemporary ages. High population density is 
therefore probably a consequence of successful pacification, rather than 
a cause of strife”. 

Socio-political organization is a factor widely invoked to explain 
changes in violence". To assess this effect, we classified human 
populations into four types”: bands, tribes, chiefdoms and states. 
Levels of lethal violence in prehistoric bands and tribes did not differ 
from the phylogenetic inferences (Fig, 34). However, lethal violence is 
common in present-day bands and tribes (Fig. 3d), possibly because 
there are more detailed data on mortality from living people than from 
archaeological records. Nevertheless, some authors suggest that the 
level of lethal violence has increased in hunter-gatherers because they 
now live in denser populations in which intergroup conflicts are more 
likely", o because they have contacted colonial societies where warfare 
or interpersonal violence is frequent. The level of lethal violence in 
chiefdoms was also higher than the phylogenetic inferences (Fig. 3d). 
Severe violence has been frequently reported in chiefdoms", mostly 
caused by territorial disputes, population and resource pressures, and 
competition for political status™. Finally, the level of lethal violence 
in state societies was lower than the phylogenetic inferences (Fig. 3d). 
It is widely acknowledged that monopolization of the legitimate 
use of violence by the state significantly decreases violence in state 
societies!"*°, 

In this study, we have explored the origin and evolution of human 
lethal violence by integrating a phylogenetic approach with an empirical 
analysis of lethal violence in human populations. The phylogenetic 
analysis suggests that a certain level of lethal violence in humans 
arises from the occupation of a position within a particularly violent 
mammalian clade, in which violence seems to have been ancestrally 
present. This means that humans have phylogenetically inherited 
their propensity for violence. We believe that this phylogenetic effect 
entails more than a mere genetic inclination to violence. In fact, 
social behaviour and territoriality, two behavioural traits shared with 
relatives of H. sapiens, seem to have also contributed to the level of 
lethal violence phylogenetically inherited in humans. Our analysis of 
human lethal violence shows that lethal violence in prehistoric humans 
matches the level inferred by our phylogenetic analyses, suggesting 
that we were, at the dawn of humankind, as violent as expected 
considering the common mammalian evolutionary history. This pre- 
historic level of lethal violence has not remained invariant but has 
changed as our history has progressed, mostly associated with changes 
in the socio-political organization of human populations. This suggests 
that culture can modulate the phylogenetically inherited lethal violence 
in humans. 
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Figure 3 | Lethal violence in humans. a-d, Box plots showing a, 

the phylogenetic inferences of human lethal violence assessed as the 
percentage of human deaths caused by conspecifics. These estimates were 
achieved through phylogenetic generalized linear models and correspond 
to the ancestral node of the tree rooted at the node separating H. sapiens 
from the rest of the mammals. All models were performed after logit- 
transforming the dependent variable and considering the intraspecific 
variation in mammal lethal aggression. Phylogenetic uncertainty was 
incorporated by using the tree provided by Fritz et al.”? (grey colour) and 
a set of 100 randomly sampled trees from Faurby and Svenning”® (white 
colour). b, The lethal aggression inferred for six important ancestral 
nodes of human evolution (apes, primates, Euarchonta, Euarchontoglires, 
placental mammals, and all mammals). ¢, Human lethal violence during 
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METHODS 

No statistical methods were used to predetermine sample size. The investigators 
‘were not blinded to allocation during experiments and outcome assessment. 
Lethal aggression in mammals. To estimate lethal aggression in mammals 
(defined as the percentage of deaths caused by conspecifics) we compiled a database 
including the amount of conspecific killing observed in many species of mammals. 
We conducted computer searches including the words (alone or in combination): 
‘mammal, mortality factors, causes of mortality, ‘infanticide, ‘death, ‘conspecific 
conspecific fighting, “intraspecific aggression’ and ‘conspecific aggression, 
as well as some other words related to relevant mortality factors in some mammal 
species, such as ‘bushmeat, ‘toad killing’ and ‘overhunting’ We pooled all sources 
of conspecific mortality (active and passive infanticide, intergroup aggression, 
cannibalism and intraspecific predation, male-male fighting during mating 
period, territorial defensive behaviour, maternal abandonment, accidental injury). 
We considered only lethal conspecific interactions, ignoring non-lethal aggression, 
because the recording of aggressive interactions ending in the death of any of the 
interacting organisms, both in humans and non-human mammals, is more precise 
We found information about more than four million deaths in the 1,024 mammal 
species (~20% of the total species) from 137 families (~~80% of total families) and 
the three main extant mammalian clades (Prototheria, Metatheria and Eutheria) 
(Supplementary Information section 9a). We obtained information from several 
studies in order to incorporate the intraspecific variability in lethal aggression for 
‘each mammal species. For each mammal included in our database, we recorded its 
territoriality (yes or no) and social behaviour (social or solitary) using information 
compiled in the Animal Diversity Web (http://www.animaldiversity.org), 
‘Mammal phylogeny. The phylogenetic relationship between the mammals 
included in the database was built using Fritz et al and Faurby and Svenning” 
phylogenies, which are updated phylogenies of the supertree of Bininda-Emonds 
et al”, to account for the more recent mammalian taxonomy of Wilson and 
Reeder™. First, we used the phylogeny provided by Fritz et al” including 5,020 
extant mammals. Afterwards, we used a set of 100 phylogenies provided by Faurby 
and Svenning” that contains 5,747 extant and extinct mammals (including species 
‘with dated records from the Late Pleistocene, defined as the last 130,000 years). Using 
this set of phylogenies, we were able to incorporate phylogenetic uncertainty in all 
our analyses. In each phylogeny we pruned all species not included in the database 
and, inthe few casesin which a species was missing in the supertree, we selected the 
closest relative (usually, congeneric species, see Supplementary Information section 
9a), Mortality data about subspecies were pooled at the species level, 

‘We performed additional analyses with the inclusion of H. neanderthalensis 
because: i) close relatives of modern humans can be very informative to estimate 
their phylogenetically shared traits, and i) including fossils in the phylogeny results 
in more reliable ancestral state reconstructions”. The Faurby and Svenning” 
phylogeny includes H. neanderthalensis, However, the Fritz et al phylogeny 
only contains extant species. For this reason, we grafted H. neanderthalensis 
into this latter phylogeny, indicating an evolutionary divergence from H. sapiens 
0.43 million years ago (Mya)** and extinction 0.028 Mya. Although these 
dates are contested”, variations of a few thousand years did not significantly 
alter the phylogenetic prediction of human lethal violence. For example, when 
time of divergence was changed to 0.23 Mya, the mean prediction remained the 
same but with a slightly higher confidence interval. The level of lethal violence 
in H, neanderthalensis was obtained from multiple sources (see Supplementary 
Information section 9b). 

Lethal violence in humans. ‘To estimate lethal violence in humans (defined as 
the percentage of people that died owing to interpersonal violence) we compiled 
information from almost 600 human populations and societies spanning from 
the Palaeolithic to the present (Supplementary Information section 9c). Because 
of the extremely wide temporal range, we obtained information derived from 
very disparate sources, namely bioarchaeological and palaeo-osteological 
reports, ethnographic records, statistical yearbooks and verbal autopsies 
(a method to determine probable causes of death when no medical record or 
formal medical attention is available; they are performed by non-medical field 
‘workers, recording written narratives from reliable informants in local languages 
that describe the events that preceded the death), Owing to this heterogeneity, 
and because our goal was to compare the level of lethal violence in humans with 
the level of lethal aggression in mammals, we did not differentiate the specific 
causes of intraspecific mortality. Rather, we pooled together the deaths caused 
by war, homicide, manslaughter, infanticide, sacrifice, cannibalism and so on, 
without differentiating whether lethal events involved only one perpetrator or 
‘were coalitional and collective killings. Although it is worth investigating how 
specific types of violence have evolved in humans, we could not explore this 
issue because some types of violence have been insufficiently studied, both in 
non-human mammals (for example, inter-group aggression in social mammals 
other than chimpanzees) and humans (for example, infanticide in historical 
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societies). Lethal violence was determined for each source using the criteria of 
the researchers. Ethnographic records, statistical yearbooks and verbal autopsies 
commonly included the casualties of the interpersonal violence. The death 
toll owing to interpersonal violence in bioarchaeological studies was found by 
following the most widely used criterion in this type of study; that is, the presence 
of perimortem and blade injuries as an indication of death caused by interpersonal 
violence”. This means that we did not include antemortem and healed injuries 
in our calculation of lethal interpersonal violence”. Nevertheless, skeletal trauma 
should be viewed as minimal estimates, since many injuries caused by conspecifics 
do not damage the bones. 

‘The samples were categorized according to their age and socio-political 
‘organization, To assign the age to each sample, we considered the periods used 
to divide human history according to both the New World and Old World 
chronologies”. Old World human societies were grouped into Paleolithic 
(~50,000-12,000 np), Mesolithic (~12,000-10,200 np), Neolithic/Calcolithic 
(~10,200-5,000 np), Bronze Age (~5,300~3,200 wp), Iron Age (~3,200-1,300 uP) 
and Medieval periods (~1,300-500 P). New World human societies were grouped 
in Archaic (~12,000~3,000 se), Formative (~3,000-1,500 8p), Classic (~1,500- 
800 Bp) and Post-Classic periods (~800-500 np). From then on, we considered 
two further periods affecting human societies throughout the entire world, the 
Modern Age (~S00-100 u») and the Contemporary Age (~100 Be-present day). 

‘We followed the widely accepted socio-political classification", according to 
which human societies can be classified into four types: bands (small, nomadic, 
egalitarian groups of people, usually hunter-gatherers), tribes (small, mostly 
egalitarian, groups with limited social rank usually resident in permanent villages 
as hunter-horticulturalists), chiefdoms (stratified, hierarchical non-industrial 
societies usually based on kinship) and states (politically organized complex 
societies). To assign each sample to different socio-political and temporal 
«categories, we relied on the information from each original source (Supplementary 
Information section 8c). The use of standard statistics to summarize information 
coming from disparate sources with extremely different sample sizes and time 
coverage is problematic, as has been reported, To avoid such issues, we pooled 
all the samples (skeletal remains, dead individuals and so on) found during each 
period (see Supplementary Information section 8c for an exhaustive list of cases, 
‘samples and studies) and depicted them using box plots. 

Phylogenetic signal of mammal lethal aggression. The phylogenetic signal for 
lethal aggression was calculated using Pagel’s lambda" that compares the similarity 
of the covariances among species with the covariances expected under Brownian 
evolution. Significant phylogenetic signal occurs when \>Oand may take values 
of either 0<\< 1 (indicating that close relatives resemble each other less than 
expected under Brownian evolution) or A= 1 (indicating that close relatives are as 
similar as would be expected under Brownian motion). Values of \> 1 (indicating 
that close relatives are more similar than expected by Brownian evolution) cannot 
be reached because the off-diagonal elements in the variance-covariance matrix 
cannot be larger than the diagonal elements". To account for the possibility of 
a phylogenetic signal higher than expected under Brownian motion, we also 
calculated Blomberg’ K (thats, the ratio between the observed phylogenetic signal 
and that expected under a Brownian evolution model)". This phylogenetic signal 
metric is not restricted in its upper limit, and ranges from 0 (no phylogenetic signal) 
to infinity, with K= 1 indicating Brownian evolution. Statistical significance of 
Pagel’ \ was calculated through a likelihood ratio test, comparing the likelihood 
‘of the model that was fitted to the data to that of a model in which \ was fixed to 0. 
Significance of Blomberg's K was calculated through a randomization test from a 
‘null model constructed with 1,000 random permutations of the data across the tips 
of the mammal tree. Both tests were performed using the R package ‘phytools", 
‘The level of phylogenetic signal of lethal aggression in mammals measured as 
Blomberg’s K (K=0.09) was significantly higher than 0 (P= 0.013) and lower than 1 
(P-<0.001). This indicates that close relatives tend to have similar values of lethal 
violence but ata level lower than would be expected under Brownian evolution, 
‘This evolutionary pattern is consistent with that shown by Pagel’s lambda (\= 0.60) 
and therefore only this metric is shown in the main text. The evolution of lethal 
aggression throughout the phylogeny of mammals was estimated using stochastic 
mapping as implemented in the R package ‘phytools", Lethal aggression was 
logit-transformed before all analyses. 

Effect of territoriality and sociability on mammal lethal aggression. To examine 
‘which factors explained the level of lethal aggression in mammals, we performed 
a phylogenetic generalized-least-squares (PGLS) model*, with lethal aggression 
(logit-transformed) as the dependent variable and territoriality and sociability as 
independent variables. PGLS takes into account the phylogenetic signal in the 
residuals of the model fitted to the data’, To account for the intraspecific variability 
in lethal aggression, for each of the 1,024 mammal species, we generated a normal 
distribution of lethal aggression values with their empirically observed means 
and standard errors. To control for potential biases produced by between-study 


part of Springer Nature, All rights reserved. 


differences in sample siz, the means and standard errors that were used to generate 
the random distributions were first weighted by the number of individuals included 
in each study, We then ran the analysis 100 times, randomly sampling each time a 
value from each of the 1,024 normal distributions. When a species was represented 
by a single value, we used as its standard error the across-species average of 
standard errors, The analyses were run with the help of the PGLS command in 
the R package ‘caper"®. 

Phylogenetic estimation of human lethal violence. Phylogenetic trait estimation 
techniques were used to obtain the lethal violence level for H. sapiens as a 
function ofits position in the mammal phylogeny. These techniques take advantage 
of ancestral state estimation methods to predict traits of extant species™*"” 
‘The trait value of the focal species can be estimated as the ancestral node of the 
tree rerooted at the most recent common ancestor of the focal species and the rest 
of the tree", The trait value estimated with this ancestral estimation method is 
the same as that provided by the intercept of a PGLS performed on the same tree. 
However, PGLS allows us to simultaneously include the level of the phylogenetic 
signal and other traits as covariates to improve the phylogenetic estimation of the 
study trait®, Following this approach, we also estimated human lethal violence with 
the help of a PGLS approach with territoriality and sociability as covariates and the 
phylogenetic information of the mammal tree rooted in the node where H. sapiens 
diverged from the rest of the mammals, The target species must be excluded from 
the analysis to estimate the PGLS parameters. Four PGLS models were fitted to our 
data: (i) without covariates and without H. neanderthalensis; (ji) with territoriality 
and sociability as factorial covariates but without H. neanderthalensis, (iii) without 
covariates and with H. neanderthalensis, and (iv) with territoriality and sociability 
as factorial covariates and with H. neanderthalensis, In all models, the dependent 
variable was logit-transformed and its variance was included using the approach 
explained in the previous section. 

Lethal aggression in main ancestral nodes of the human lineage. We estimated 
levels of lethal aggression in the most recent common ancestor of six important 
clades defining the course of the evolutionary history of humans: the class 
Mammalia, the infraclass Placentalia (placental mammals), the superorder 
Euarchontoglires or Supraprimates (primates, tree-shrews, colugos, rodents and 
hares), the grandorder Euarchonta (primates, colugos and tree-shrews), the order 
Primates (primates) and the superfamily Hominoidea (apes). Lethal aggression in 
these ancestral nodes was inferred using the same analytical approach as that used 
to estimate lethal violence in humans. 

Accuracy of the estimation of mammal lethal aggression from the PGLS. The 
accuracy of trait-estimation in a particular species increases with the level of 
phylogenetic signal of the study trait”. To test for the accuracy of our models 
under the observed phylogenetic signal, we used leave-one-out cross-validations 
with the whole mammalian data set in Supplementary Information section 9a. 
We inferred the level of lethal violence (logit-transformed) for each mammal 
species with the PGLS procedure and compared it with its actual value. We first 
‘examined the relationship between the estimated and observed lethal violence 
values* and subsequently calculated the proportion of species for which the actual 
value fell inside the 95% confidence interval ofthe estimated trait (Supplementary 
Information section 2). 

Effect of sampling effort on the estimation of human lethal violence. To check 
whether the estimates of conspecific-mediated human mortality were influenced 
by inappropriate or insufficient sampling, we repeated all analyses considering the 
subset of mammalian species with more than 50 observations (n= 645 mammals). 
We performed PGLS analysis to test whether territorial and social behaviour stil 
influence the level of lethal aggression (Iogit-transformed) for this subset of well- 
sampled species. Afterwards, we calculated the conspecific-mediated human mortality 
using this subset of well-sampled mammals (Supplementary Information section 4). 
Effect of phylogenetic depth on the estimation of human lethal violence. 
‘To check whether the estimates of conspecific-mediated human mortality 
‘were influenced by the depth of the phylogeny, we repeated these analyses by 
progressively including deeper nodes to obtain the estimate and the 95% confidence 
intervals using the PGLS model without covariates. We considered the following, 
hierarchically nested clades, from shallower to deeper: Homininae, Hominidae, 
Hominoidea, Catarthini, Simiiformes, Haplorrhini, Primates, Primatomorpha, 
Euarchonta, Euarchontoglires, Boreoeutheria, Eutheria, Theriiformes and 
Mammalia**, We are aware that moving from shallower to deeper nodes means 
including an increasing number of species in the analyses (for example, we have 
only four Homininae but 1,022 Theriiformes in our phylogeny). To subsequently 
check whether the increasing number of species has any effect on the 95% 
confidence intervals, we repeated all analyses with random-pruned phylogenies 
equalling the number of species included in each of the clades described here 
(50 random phylogenies per clade) (Supplementary Information section 5). 
Effect of phylogeny size on the estimation of human lethal violence. ‘To check 
‘whether the estimates of conspecific-mediated human mortality were influenced 
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by the size of the phylogeny, we repeated these analyses with the progressive 
inclusion of more species in the phylogenies. Specifically, we estimated human 
lethal violence and its 95% confidence interval in 50 randomly generated 
phylogenies with 100, 200, 300, 400, 500, 800, 900 and 1,000 spp., using the PGLS 
model without covariates. Afterwards, we contrasted these values with the level of 
human lethal violence obtained using the empirical phylogeny, checking whether 
smaller phylogenies departed from empirical results more strongly than larger 
phylogenies (Supplementary Information section 6). 

Statistical difference between phylogenetically estimated lethal violence in 
humans and ancestral nodes. We have checked whether the level of lethal violence 
phylogenetically inferred in humans is different from the lethal aggression inferred 
for the main ancestral nodes using t-tests. The phylogenetic estimates of both lethal 
violence in humans and lethal aggression in ancestral mammals were obtained by 
joining the 100 values obtained for each of the four PGLS models (with and without 
covariates and with and without H. neanderthalensis) and the two mammalian 
phylogenies used (Fritz et al.?? and Faurby and Svenning”? phylogenies) 
We subsequently tested, by means of t-tests, whether these two distributions 
differed. Because we repeated the same test six times (once per ancestral node), 
‘we corrected all P values by means of sequential Bonferroni corrections. 
Statistical difference between observed and phylogenetically estimated lethal 
violence. For each temporal period and socio-political organization, we randomly 
sampled a given value of observed mortalities from a normal distribution with 
the same mean and standard error and compared it with a randomly sampled, 
phylogenetically estimated value. The phylogenetically estimated values were 
‘obtained by joining the 100 values obtained for each of the four PGLS models (with, 
and without covariates and with and without H. neanderthalensis) and the two 
mammalian phylogenies (Fritz et al.”® and Faurby and Svenning™ phylogenies). 
We repeated these paired comparisons 800 times, and recorded the proportion of 
times where the observed values were higher or lower than the phylogenetically 
estimated values. We subsequently tested, by means of binomial tests, whether this 
proportion differed from the randomly expected deviation, We ran each binomial, 
test 1,000 times and retained the average P values and deviance from the expected 
value. All P values shown underwent sequential Bonferroni correction. 
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Extended Data Table 1 | Outcome of the phylogenetic generalized linear model testing the effect of territoriality and social behaviour 
on the magnitude of lethal aggression in mammal species (n= 1,024 species) 


Estimate +s.e.m__t-value p-value 


Fritz et al.’s phylogeny 


Territoriality 0.54 + 0.50 3.80 0.001 
Social behaviour 0.47 £0.51 2.71 0.014 
Territoriality * Social behaviour 0.48 0.55 1.33 0.244 
lambda of the model 0.54 0.0001 


Faurby & Svenning’s phylogeny 


Territoriality 0.53 £0.51 3.01 0.001 
Social behaviour 0.47 £0.51 2.70 0.009 
Territoriality * Social behaviour 0.48 + 0.51 0.43 0.341 
lambda of the model 0.88 0.0001 


22 and 100 mammalian phylogenies provided by Faurby and Svenning™. In this latter. 
transformed before all analyses, 
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Extended Data Table 2 | Outcome of the t-tests assessing difference between the inferred value of lethal violence at each of the chosen 
ancestral nodes in the mammalian phylogeny and the phylogenetic estimates of human lethal violence 


Ancestral Nodes t-test p-value Significance 
Class Mammalia (mammals) -72.49 0.0001 YES 
Infraclass Placentalia (placentals) -70.88 0.0001 YES 
Superorder Euarchontoglires (primates, rodents, hares) -50.50 0.0001 YES 
Grandorder Euarchonta (primates, tree-shrews, colugos) 15.66 0.0001 YES 
Order Primates (primates) 20.78 0.0001 YES 
Superfamily Hominoidea (apes) -16.31 0.0001 YES 
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Extended Data Table 3 | Outcome of the binomial tests assessing difference between the observed lethal violence in human societies 
and the inferred lethal violence according to the phylogenetic analysis 


Difference between the p-value Significance 
observed and the 
phylogenetically inferred 
lethal violence 
Temporal periods 
Old World Chronology 
Paleolithic +0.0% 0.522 NO 
Mesolithic +4.6% 0.064 NO 
Neolithic +5.4% 0.029 NO 
Bronze Age +2.5% 0.272 NO 
Iron Age +8.1% 0.002 YES 
Medieval Age +7.7% 0.002 YES 
New World Chronology 
Archaic +45% 0.073 NO 
Formative +6.2% 0.0001 YES 
Classic +16.5% 0.0001 YES 
Postclassic + 13.0% 0.0001 YES 
The Entire World 
Modern Age -37.1% 0.0001 YES 
Contemporary Age -23.7% 0.0001 YES 
Type of society 
Historic Band +46% 0.131 NO 
Historic Tribe +44% 0.116 NO 
Historic Chiefdom +5.7% 0.009 YES 
Historic State -42.9% 0.0001 YES 
Contemporary Band +18.5% 0.00001 YES 
Contemporary Tribe +12.2% 0.003 YES 
Contemporary State -274% 0.00001 YES 
We compared the observed lathalvolance af each typeof human society withthe magnitudes at lthalvilence obtained according to the four PGLS models (wth an witho 


and without 


Bonferroni corn 


art of Springer Natur 


s) and the two mammalian phylogenies (Fritz etal? and Faurby and Svenning™ phylogenies) Each binomial 


Allrights res 


fs.un 1000 times. Significant 


sequential 


LETTER 


.0.1038/nature19792 


Genomic analyses inform on migration events 
during the peopling of Eurasia 
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High-coverage whole-genome sequence studies have so far focused 
ona limited number! of geographically restricted populations, 
or been targeted at specific diseases, such as cancer®, Nevertheless, 
the availability of high-resolution genomic data has led to the 
development of new methodologies for inferring population 
history’~® and refuelled the debate on the mutation rate in humans"”, 
Here we present the Estonian Biocentre Human Genome Diversity 
Panel (EGDP), a dataset of 483 high-coverage human genomes 
from 148 populations worldwide, including 379 new genomes from 
125 populations, which we group into diversity and selection 
sets. We analyse this dataset to refine estimates of continent-wide 
patterns of heterozygosity, long- and short-distance gene flow, archaic 
admixture, and changes in effective population size through time as 
wellas for signals of positive or balancing selection. We find a genetic 
signature in present-day Papuans that suggests that at least 2% of 
their genome originates from an early and largely extinct expansion 
of anatomically modern humans (AMHs) out of Africa. Together 
with evidence from the western Asian fossil record’, and admixture 
between AMHs and Neanderthals predating the main Eurasian 
expansion”, our results contribute to the mounting evidence for 
the presence of AMHs out of Africa earlier than 75,000 years ago. 

‘The paths taken by AMHs out of Africa (OoA) have been the subject 
of considerable debate over the past two decades. Fossil and archaeo- 
logical evidence'*", and craniometric studies" of African and Asian 
populations, demonstrate that Homo sapiens was present outside of 
Africa ~120-70 thousand years ago (kya)'', However, this colonization 
has been viewed asa failed expansion OoA"® since genetic analyses of 
living populations have been consistent with a single OoA followed by 
serial founder events"’. 

Ancient DNA (aDNA) sequencing studies have found support 
for admixture between early Eurasians and at least two archaic 
human lineages", and suggest modern humans reached Eurasia 
at around 100 kya’. In addition, aDNA from modern humans 
suggests population structuring and turnover, but little additional 
archaic admixture, in Eurasia over the last 35-45 thousand years” 
Overall, these findings indicate that the majority of human genetic 
diversity outside Africa derives from a single dispersal event that was 
followed by admixture with archaic humans'* 

We used ADMIXTURE to analyse the genetic structure in our 
diversity set (Extended Data Figs 1, 2; Supplementary Information 
1.1-7). We further compared the individual-level haplotype similarity 
of our samples using fineSTRUCTURE (Extended Data Fig. 3). Despite 
small sample sizes, we inferred 106 genetically distinct populations 
forming 12 major regional clusters, corresponding well to the 148 
self-identified population labels. This clustering forms the basis for 
the groupings used in the scans of natural selection. Similar genetic 
affinities are highlighted by plotting the outgroup f, statistic’ in the 
form f;(X, Y; Yoruba), which here measures shared drift between a 
non-African population X and any modern or ancient population ¥ 
from Yoruba as an African outgroup (Supplementary Information 2.2.6, 
Extended Data Fig. 4). 

Our sampling allowed us to consider geographic features correlated 
with gene flow by spatially interpolating genetic similarity measures 
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between pairs of populations (Supplementary Information 2.2.2). 
We considered several measures and report gradients of allele fre- 
quencies in Fig. 1, which was compared to gene flow patterns from 
EEMS™ as a validation (Extended Data Fig. 5). Controlling for pairwise 
geographic distance, we find a correlation between these genetic 
gradients and geographic and climatic features such as precipitation 
and elevation (inset of Fig, 1, Supplementary Information 2.2.2). 

We screened for evidence of selection by first focusing on loci that 
showed the highest allelic differentiation among groups (Supplementary 
Information 3). We then performed positive and purifying selection 
scans (Methods), and found some candidate loci that replicate 
previously known and functionally supported findings (Supplementary 
Table 1:3.3.4-I, Supplementary Information 3.1, Extended Data Fig. 6; 
Supplementary Table 1:3.1-IV,V1). Additionally, we infer more 
purifying selection in Africans in genes involved in pigmentation 
(bootstrapping p value (bpv) for Ryiy scores < 0.05) (Extended Data 
Fig. 6) and immune response against viruses (bpv <0.05), while further 
purifying selection was indicated on olfactory receptor genes in Asians 
(bpv < 0.05) (Supplementary Table 1:3.1.1-II). Our scans for ancient 
balancing selection found a significant enrichment (FDR <0.01) of 
antigen processing/presentation, antigen binding, and MHC and 
membrane component genes (Supplementary Information 3.2 and 3.3, 
Supplementary Table 1:3.3,2-I-IIl). The HLA (HLA-C)-associated gene 
(BTNL2) was the top highest scoring candidate in 8 of 12 geographic 
regions for the HKA test (Supplementary Table 1:3.3.1-1). Our positive 
selection scans, variant-based analyses (Supplementary Information 3.2 
and 3.3) and gene enrichment studies also suggest new candidate loci 
(Supplementary Information 3.4 and 3.5, Supplementary Table 1:3.5-I- 
V1), subset of which is highlighted in Supplementary Table 1:3-1. 

Using fineSTRUCTURE, we find in the genomes of Papuans and 
Philippine Negritos more short haplotypes assigned as African than 
seen in genomes for individuals from other non-African popula- 
tions (Extended Data Fig. 7). This pattern remains after correcting 
for potential confounders such as phasing errors and sampling bias 
(Supplementary Information 2.2.1). These shorter shared haplo- 
types would be consistent with an older population split?®. Indeed, 
the Papuan-Yoruban median genetic split time (using multiple 
sequential Markovian coalescent (MSMC)) of 90 kya predates 
the split of all mainland Eurasian populations from Yorubans at 
~75 kya (Supplementary Table 1:2.2.3-I, Extended Data Fig. 4, Fig. 2a). 
This result is robust to phasing artefacts (Extended Data Fig. 8, see 
Methods). Furthermore, the Papuan-Eurasian MSMC split time of 
~A0 kya is only slightly older than splits between west Eurasian and 
East Asian populations dated at ~30 kya (Extended Data Fig, 4). The 
Papuan split times from Yoruba and Eurasia are therefore incompatible 
with a simple bifurcating population tree model. 

At least two main models could explain our estimates of older 
divergence dates for Sahul populations from Africa than mainland 
Eurasians in our sample: 1) admixture in Sahul with a potentially 
un-sampled archaic human population that split from modern humans 
either before or at the same time as did Denisova and Neanderthal; or 2) 
admixture in Sahul with a modern human population (extinct OoA 
line; xOoA) that left Africa after the split between modern humans 
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Figure 1 | Genetic barriers across space. Spatial visualization of genetic 
barriers inferred from genome-wide genetic distances, quantified as the 
magnitude of the gradient of spatially interpolated allele frequencies 
(value denoted by colour bar; grey areas have been land during the last 
glacial maximum but are currently underwater). Here we used a spatial 
Kernel smoothing method based on the matrix of pairwise average 
heterozygosity and a MATLAB script that plots the hexagons of the grid 
with a colour coding to represent gradients, Inset, partial correlation 


and Neanderthals, but before the main expansion of modern humans 
in Eurasia (main OoA). 

We consider support for these two non-mutually exclusive scenarios. 
Because the introgressing lineage has not been observed with aDNA, 
standard methods are limited in their ability to distinguish between 
these hypotheses. Furthermore, we show (Supplementary Information 
2.2.7) that single-site statistics, such as Pattersons D™!* and sharing of 
non-African Alleles (nAAs), are inherently affected by confounding 
effects owing to archaic introgression in non-African populations”. 
Our approach therefore relies on multiple lines of evidence using 
haplotype-based MSMC and fineSTRUCTURE comparisons (which 
we show should have power at this timescale®*; Supplementary 
Information 2.2.13). 

We located and masked putatively introgressed?” Denisova haplo- 
types from the genomes of Papuans, and evaluated phasing errors by 
symmetrically phasing Papuans and Eurasians genomes (Methods). 
Neither modification (Fig. 2a, Supplementary Information 2.2.9, 
Supplementary Table 1:2.2.9-1) changed the estimated split time (based 
on MSMC) between Africans and Papuans (Methods, Supplementary 
Information 2.2.8, Extended Data Fig. 8, Supplementary Table 1.2.8-1). 
MSMC dates behave approximately linearly under admixture 
(Extended Data Fig. 8), implying that the hypothesized lineage 
may have split from most Africans around 120 kya (Supplementary 
Information 2.2.4 and 2.2.8), 

We compared the effect on the MSMC split times of an xOoA or 
a Denisova lineage in Papuans by extensive coalescent simulations 
(Supplementary Information 2.2.8). We could not simulate the large 
Papuan-African and Papuan—Eurasian split times inferred from 
the data, unless assuming an implausibly large contribution from a 
Denisova-like population. Furthermore, while the observed shift in 
the African-Papuan MSMC split curve can be qualitatively reproduced 
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between magnitude of genetic gradients and combinations of different 
geographic factors, elevation (E), temperature (7) and precipitation (R), 
for genetic gradients from fineSTRUCTURE (red) and allele frequencies 
(blue). This analysis (Supplementary Information 2.2.2 for details) shows 
that genetic differences within this region display some correlation with 
physical barriers such as mountain ranges, deserts, forests, and open 
water (such as the Wallace line). 


when including a 4% genomic component that diverged 120 kya from 
the main human lineage within Papuans, a similar quantity of Denisova 
admixture does not produce any significant effect (Extended Data 
Fig. 8). This favours a small presence of xOoA lineages rather than 
Denisova admixture alone as the likely cause of the observed deep 
‘African-Papuan split. We also show (Methods) that such a scenario is, 
compatible with the observed mitochondrial DNA and Y chromosome 
lineages in Oceania, as also previously argued™™*, 

We further tested our hypothesized xOoA model by analysing 
haplotypes in the genomes of Papuans that show African ancestry not 
found in other Eurasian populations. We re-ran fineSTRUCTURE 
adding the Denisova, Altai Neanderthal and the Human Ancestral 
Genome sequences” to a subset of the diversity set. FineSTRUCTURE 
infers haplotypes that have a most recent common ancestor (MRCA) 
with another individual. Papuan haplotypes assigned as African had, 
regardless, an elevated level of non-African derived alleles (that is, 
nAAs fixed ancestral in Africans) compared to such haplotypes in 
Eurasians. They therefore have an older mean coalescence time with 
our African samples. 

Owing to the deep divergence between the sampled Denisova and the 
one introgressed into modern humans, itis possible that some archaic 
haplotypes havea MRCA with an African instead of Denisova and are 
assigned as ‘African. We can resolve the coalescence time, and hence 
origin, of these haplotypes by their sequence similarity with modern 
‘Africans. To account for the archaic introgression we modelled these 
genomic segments as a mixture of haplotypes assigned a) as African 
orb) as Denisova in Eurasians and c) haplotypes assigned as Denisova 
in Papuans. These haplotypes are modelled (see Methods, Extended 
Data Fig. 9) in terms of the distribution of length and mutation rate 
measured as a density of non-African derived alleles. Since Eurasians 
(specifically Europeans) have not experienced Denisova admixture, 
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Figure 2 | Evidence of an xOoA signature in the genomes of modern 
Papuans. a, MSMC split times plot. The Yoruba—Eurasia split curve shows 
the mean of all Eurasian genomes against one Yoruba genome. The grey 
area represents top and bottom 5% of runs. We chose a Koinanbe genome 
as representative of the Sahul populations. b-d, Decomposition of Papuan 
haplotypes inferred as African by fineSTRUCTURE, b, Semi-parametric 
decomposition of the joint distribution of haplotype lengths and 
non-African derived allele rate per SNP, showing the relative proportion 
of haplotypes in K=20 components of the distribution, ordered by 
non-African derived allele rate, relative to the overall proportion of 


this approach disentangles lineages that coalesce before the human/ 
Denisova split from those that coalesce after. 

We found that the xOoA signature (Fig. 2b-d; Supplementary 
Information 2.2.10) was necessary to account for the number of short 
haplotypes with ‘moderate’ nAAs density in the data (thatis, proportion 
of non-African-derived sites higher than that of Eurasian haplotypes 
assigned as African but significantly lower than that of those assigned 
Denisova in either Eurasians or Papuans). Consistent with our MSMC 
findings (Supplementary Information 2.2.4), xOoA haplotypes have 
an estimated MRCA 1.5 times older than the Eurasian haplotypes in 
Papuan genomes, while the Denisovan haplotypes in Papuans are four 
times older than the Eurasian haplotypes. Adding up the contributions 
across the genome (Methods) leads to a genome-wide estimate of 1.9% 
xOoA (95% confidence interval 1.5-3.3) in Papuans, which we view 
as alower bound. 

Our results consistently point towards a contribution from a modern 
human source for derived” alleles that are found in the genome 
sequence of Papuans but not in Africans. Possible confounders could 
involve a shorter generation time in Papuan and Philippine Negrito 
populations”, different recombination processes, or alternative demo- 
graphic histories that have not been investigated here. We therefore 
strongly encourage the development of new model-based approaches 
that can investigate further the haplotype patterns described here. 

In conclusion, our results suggest that while the genomes of modern 
Papuans derive primarily from the main expansion of modern humans 
out of Africa, we estimate that at least 2% of their genome sequence 
reflects an earlier, otherwise extinct, dispersal (Extended Data Fig. 10). 

The inferred date of the xOoA split time (~120kya) is consistent 
with fossil and archaeological evidence for an early expansion of 
H. sapiens from Africa". Furthermore, the recently identified 
modern human admixture into the Altai Neanderthal before 100 kya? 
is consistent with a modern human presence outside Africa well 
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haplotypes in each component. The four datasets produced by considering 
haplotypes inferred as (African/Denisova) in (Europeans/Papuans) 

are shown with our inferred ‘extra Out-of-Africa’ (xOoA) component. 
AER, African; DEN, Denisova; PNG, Papuans; EUR, Europeans, ¢, The 
properties of the components in terms of non-African derived allele rate, 
on which the components are ordered, and length. d, The reconstruction 
of haplotypes inferred as African in the genomes of Papuan individuals, 
using a mixture of all other data (red) and with the addition of the xOoA 
signature (black). 


before the main OoA split time (~75kya). Further studies will con- 
firm whether the Papuan genetic signature reported here and the 
one observed in Altai Neanderthals reflect the same xOoA human 
group, as well as clarify the timing and route followed during such an 
early expansion. The high similarity between Papuans and the Altai 
‘Neanderthal reported in Extended Data Fig. 1 may indeed reflect a 
shared xOoA component. Further studies are needed to explore this 
model and suggest that understanding human evolutionary history 
will require the recovery of DNA from additional fossils, and further 
archaeological investigations in under-explored geographical regions. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only in the online paper. 
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METHODS 

No statistical methods were used to predetermine sample size. The experiments 
‘were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Data preparation. We analyse a set of genomes sequenced by the same technology 
(Complete Genomics Inc.) which results in minimal platform differences between 
batches of samples analysed by slight modifications of CG proprietary pipeline 
(Extended Data Fig. 2; Supplementary Information 1.6). Informed consent forms 
and REC approvals were obtained for all samples newly collected for this study. 
We see good concordance between CG sequence and Illumina genotyping array 
results for the same samples with minor reference bias in the later data (Extended 
Data Fig. 2; Supplementary Information 1.6). In the final dataset, we retained only 
‘one second-degree (Australians, to make use ofall the available samples) and five 
third-degree relatives pairs (Supplementary Table 1:1.7-1), All genomes were 
annotated against the Ensembl GRCh37 database and compared to dbSNP Human 
Build 141 and Phase 1 of the 1000 Genomes Project dataset” (Supplementary 
Information 1.11.6). We found 10,212,117 new SNPs, 401,911 of which were 
cexonic. As expected from our sampling scheme, existing lists of variable sites have 
been extended mostly by the Siberian, Southeast Asian and South Asian genomes, 
which contribute 89,836 (22.4%), 63,964 (15.9%) and 40,758 (10.1%) of the new 
‘exonic variants detected in this study. 

‘Compared to the genome-wide average, we see fewer heterozygous sites on 
chromosomes | and 2, and an excess on chromosomes 16, 19 and 21 (Extended 
Data Fig. 2). This pattern is independent of simple potential confounders, such 
as rough estimates of recombination activity and gene density (Supplementary 
Information 1,8), and mirrors the inter-chromosomal differences in divergence 
from chimpanzee", suggesting large-scale differences in mutation rates among 
chromosomes. We confirmed this general pattern using 1000 Genomes Project, 
data (Supplementary Information 1.8). 

‘The ‘ancient genome diversity panel’ consisted of 106 samples from the main 
Diversity panel along with Altai Neanderthal, Denisova and the Modern Human 
reference genome. Sites that are heterozygous in archaic humans were removed. 
Geographic gradient analyses. We used a Gaussian kernel smoothing (based on 
the shortest distance on land to each sample) to interpolate genetic patterns across 
space, Averaging aver all markers, we obtained an expression for the mean square 
{gradient of allele frequencies in terms of the matrix of genetic distance between 
pairs of samples (Supplementary Information 2.2.2). This provides a simple way 
to identify spatial regions that contribute strongly to genetic differences between 
samples, and can be used, in principle, for any measure of genetic difference 
(or fineSTRUCTURE data, we used negative shared haplotype length as a measure 
of differentiation), 

‘To quantify the link between the magnitude of genetic gradients (from 
fineSTRUCTURE and allele frequency data) and geographic factors, we fitted a 
‘generalized linear model to the sum of genetic magnitude gradients on the shortest 
paths between samples to elevation, minimum quarterly temperature, and annual 
precipitation summed in the same way, controlling for path length and spatial 
random effects (Supplementary Information 2.2.2), and calculated partial 
correlations between genetic gradient magnitudes and geographic factors. 
FineSTRUCTURE analysis. FineSTRUCTURE™ was run as described in 
Supplementary Information 2.2.1. Within the 106 genetically distinct genetic 
groups, labels were typically genetically homogeneous—113 ofthe 148 population 
labels (76%) were assigned to only one ‘genetic cluster: Similarly, genetic clusters 
were typically specific to a label, with 66 of the 106 ‘genetic clusters’ (62%) 
containing only one population label 
Correction for phasing errors. To check whether phasing errors could produce 
the shorter Papuan haplotypes, we focused on regions of the genome that had 
an extended (>500kb) run of homozygosity. We ran ChromoPainter for each 
individual on only these regions, meaning each individual was only painted 
where it had been perfectly phased. This did not change the qualitative features 
(Supplementary Information 2.2.1). 

Removal of similar samples. Papuans are genetically distinct from other 
populations due to tens of thousands of years of isolation. We wanted to check 
whether the length of haplotypes assigned as African was biased by the inclusion of 
a large number of relatively homogeneous Eurasians with few Papuans. To do this 
we repeated the n= 447 painting allowing only donors from dissimilar populations, 
including only individuals who donated <2% of a genome in the main painting. 

‘This did not change the qualitative haplotype length features (Supplementary 
Information 2.2.1) 

Inclusion of ancient samples. We ran our smaller individual panel with 
(n= 109) and without (n= 106) ancient samples (Denisova, Neanderthal and 
ancestral human). This did not change the qualitative haplotype length features 
(Supplementary Information 2.2.1), 
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Selection analyses. We investigated balancing, positive and purifying selection 
for a part of the dataset with larger group sizes which was defined as the Selection 
subset (Supplementary Table 1:3.1-Iand 32-1) using a wide range of window-based 
as well as variant-based approaches. Furthermore, we investigated how these 
signals relate to shared demographic history. Where possible we contextualized 
our findings by integrating them with information from various functional 
databases. Detailed descriptions ofall methods used are available in Supplementary 
Information section 3. 

‘SMC, Denisova masking, simulations of alternative scenarios and assessment 
of phasing robustness. Genetic split times were initially calculated following the 
standard MSMC procedure’, and subsequently modified as follows. To estimate 
the effect of archaic admixture, putative Denisova haplotypes were identified 
in Papuans using a previously published method” and masked from all the 
analysed genomes. Particularly, whether a putative archaic haplotype was found 
in heterozygous or homozygous state within the chosen Papuan genome, the 
‘affected’ locus was inserted into the MSMC mask files and, hence, removed from 
the analysis, 

‘We note that a fraction of the Denisova and Neanderthal contributions to the 
Papuan genomes may be indistinguishable, owing to the shared evolutionary 
history of these two archaic populations. Asa result, some of the removed 
“Denisova’ haplotypes may have actually entered the genome of Papuans through 
Neanderthal. Regardless of this, our exercise successfully shows that the MSMC. 
split time estimates are not affected by the documented presence of archaic 
genomic component (whether coming entirely from Denisova or partially shared 
with Neanderthal) 

‘We further excluded the role of Denisova admixture in explaining the deeper 
African—Papuan MSMC split times through coalescent simulations (using ms to 
{generate 30 chromosomes of 5 Mbp each, and simulating each scenario 30 times). 
‘These showed that the addition of 4% Denisova lineages to the Papuan genomes 
does not change the MSMC results, while the addition of 4% xOoA lineages 
recreates the qualitative shift observed in the empirical data. 

Phasing artefacts were also taken into account as putative confounders of the 

MSMC split time estimates, We te-ran MSMC after re-phasing one Estonian, one 
Papuan and 20 West African and Pygmy genomes in a single experiment. This way 
wwe ruled out potential artefacts stemming from the excess of Eurasian over Sahul 
samples during the phasing process. Both the archaic and phasing corrections 
yielded the same split time as of the standard MSMC runs. 
Emulation of all pairwise MSMC split times. We confirmed that none of the other 
populations behaved as an outlier from those identified in the n =22 full pairwise 
analysis by estimating the MSMC split times between all pairs. We chose 
9 representative populations (including Papuan, Yoruba and Baka) from the 22, 
and compared each of the 447 diversity panel genomes to them. For each individual 
Inot in our panel, we obtain the positive mixture weights ay using the model 


YS onty for j € (1.9) 
im 


‘The parameters are estimated using the j € (1..9) observations for which we have 
data using a quadratic loss function. We can then predict the unobserved values 


Examination of this matrix (Supplementary Information 2.2.3, Supplementary 
‘Table 1:2.2.3-I1) implies no other populations are expected to have unustial MSMC 
split times from Africa, 

‘Mixture model for African haplotypes in Papuans. Obtaining haplotypes from 
painting. We define African or Archaic haplotypes in Eurasians or Papuans as 
genomic loci spanning at least 1,000 bp, and showing SNPs that were assigned 
by chromopainter a >50% chance of copying from either an African or Archaic 
genome, respectively. For each haplotype we then calculated the number of 
non-African mutations, defined as sites found in derived state in a given haplotype 
and in ancestral state in all ofthe African genomes included in the present study. 
‘Modelling. We used a non-parametric model for the joint distribution of length 
and non-African derived allele mutation rate in haplotypes. We fit K = 20 
components to the joint distribution. Each component has a characteristic length 
|g, variability 04 and mutation rate jy. A haplotype of length [, with X, such 
mutations from component J,= k has the following distribution: 


1 


I {las 73s LK} — log-Normal( |i, 03) 


Xi {lis yy = K]-Binomiial( Ip, p15) 
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‘This model for haplotype lengths is motivated by the extreme age of the split 
times we seek to model, Recent splits would lead to an exponential distribution 
of haplotype lengths. However, owing to haplotype fixation caused by finite 
population size, very old splits have finite (non-zero) haplotype lengths. 
Additionally, the data are left-censored since we cannot reliably detect haplotypes 
that are very short. We note that while this makes a single component a reasonable 
fit to the data, as K increases the specific choice becomes less important. 

‘We then impose the prior p(I) = k) = 1/Kand use the expectation-maximization 
algorithm to estimate the mixture proportions mg = E({q\li.X,) along with the 
maximum likelihood parameter estimates {Ijy 0} jy}. We do this for the four 
combinations of haplotypes assigned as African (AFR) and Denisova (DEN) found 
in Papuans (PNG) or Europeans (EUR), in order to learn the parameters. 
Supplementary Information 2,2,10 describes this in more detail. We then describe 
the distribution of haplotypes for each class cof haplotype in terms of the expected 
proportion of haplotypes found in each component, 


where aa = )) Fes 


where N- is the number of haplotypes of class c. x is a vector of the proportions 
from each of the K components, 

Single-out-of-Africa model, We fit haplotypes assigned as African in Papuans asa 
mixture of the others in a second layer of mixture modelling: 


FNG.AFR= D 
¢ €{PNG.DEN, EUR AFR EUR DEN] 


where a sum to 1. This is straightforward to fit. 
x00A model. We jointly estimate an additional component 00a and the mixture 
contributions Jj, under the mixture 


TPNGAFR= 3} 
© G[PNG.DEN.EURAFREURDEN.x004) 


AM 


‘This is non-trivial to fit. We use a penalization scheme to simultaneously ensure 
‘we a) obtain a valid mixture for (j; b) give a prediction xy that is also a valid 
mixture; c) leave little signal in the residuals; and d) obtain a good fit. Cross- 
validation is used to obtain the optimal penalization parameters (A and B) with 
the loss function: 


x 
loss= So ef + APL + BPs, 
in 


where e, are the residuals in each component, Py = + 


4 


fes| 


(fora valid mixture) and Py = s. d(e,) (for requirement c, good solutions will have 


similar residuals across components). The loss is minimized via standard 
optimization techniques. Supplementary Information 22.10 details how initial 
values are found and explores the robustness of the solution to changes in A and 
B—the results do not change qualitatively for reasonable choices of these 
parameters, and the mixtures are valid to within numerical error. 

Genome-wide xOoA estimation, We used the estimated xOoA derived allele 
mutation rate estimate #09 to estimate the xOoA contribution in haplotypes 
classed as Eurasian or Papuan by ChromoPainter. First we obtained estimates of 
meNG.eve and 7pNc.onc using the single out-of-Africa model above, additionally 
allowing for a EUR.EUR contribution, We then estimate 4 094 using the observed 
mutation rate My. and that predicted under the mixture model Ouix by rearranging. 
the mixture: 


Fos = Ox00Ax008 + (1 — Ax00A)Ornix 


Estimates less than 0 are set to 0. The genome-wide estimate is obtained by 
‘weighting each 0 by the proportion of the genome that was painted with that donor, 
Neanderthal and Denisova haplotypes were assumed to be proxied by PNG.DEN 
(0% xOoA by assumption); African haplotypes by PNG.AFR; Papuan and 
Australian by PNG.PNG and all other haplotypes by PNG.EUR. We obtain 
confidence intervals by bootstrap resampling of haplotypes for each donor/ 
recipient pair. 

‘We estimate the proportion of xOoA in Papuan haplotypes assigned as both 

Eurasian (0.1%, 95% CI 0-2.6) and Papuan (4%, 95% Cl 2.9-4.5) (Supplementary 
Information 2.2.10), by using the estimated mutation density in xOoA. 
'Y chromosome and mtDNA haplopgroup analysis. The presence of an extinct 
xOOA trace in the genome of modern Papuans may seem at odds with analyses 
‘of mtDNA and Y chromosome phylogenies, which point to a single, recent origin 
for all non-African lineages (mtDNA L3, which gives rise to all mtDNA lineages 
‘outside Africa has been dated at ~70,000 years old"), However, uniparental 
markers inform on a small fraction of our genetic history, and a single origin for all 
non-African lineages does not exclude multiple waves OoA from a shared common 
ancestor. We show analytically (Supplementary Information 2.2.12) that, if the 
xOoA signature entered the genome of Papuan individuals > 40 kya, their mtDNA. 
and Y lineages could have been lost by genetic drift even assuming an initial xOoA 
mixing component of up to 35%, Similar findings have been reported recently" 
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Extended Data Figure 1 | Sample Diversity and Archaic signals. 

a, Map of location of samples highlighting the diversity/selection sets. 

b, Sample-level heterozygosity is plotted against distance from Addis 
Ababa. The trend line represents only non-African samples. The inset 
shows the waypoints used to arrive at the distance in kilometres for each 
sample. c, ADMIXTURE plot (K = 8 and 14) which relates general visual 
inspection of genetic structure to studied populations and their region of 
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Siberia 


Isiand Southeast Asia 4 
Southeast Asia « 


origin. d, Box plots were used to visualize the Denisova (red), Altai (green) 
and Croatian Neanderthal (blue) D distribution for each regional group 

of samples. Oceanian Altai D values show a remarkable similarity with the 
Denisova D values for the same region, in contrast with the other groups of 
samples where the Altai box plots tend to be more similar to the Croatian 
Neanderthal ones. Boxes show median, first and third quartiles, with 1.5% 
interquartile range whiskers and black dots as outliers, 
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Extended Data Figure 2 | Data quality checks and heterozygosity distribution of transition/transversion ratio subdivided by chromosomes. 
patterns. a, b, Concordance of DNA sequencing (Complete Genomics f, Inter-chromosome differences in observed heterozygosity in 
Inc.) and DNA genotyping (Illumina genotyping arrays) data (ref-ref; 447 samples from the diversity set. g, Inter-chromosome differences 
het-ref-alt and hom-alt-alt, see Supplementary Information 1.6) from chip __in observed heterozygosity in a set of 50 unpublished genomes from 
(a) and sequence data (b). ¢, Coverage (depth) distribution of variable the Estonian Genome Center, sequenced on an Illumina platform at an 
positions, divided by DNA source (blood or saliva) and complete genomic _average coverage exceeding 30x. h, Inter-chromosome differences in 
calling pipeline (release version). d, Genome-wide distribution observed heterozygosity in the phase 3 of the 1000 Genomes Project. 
of transition/transversion ratio subdivided by DNA source (saliva or ‘The total number of observed heterozygous sites was divided by the 
blood) and by complete genomic calling pipeline. e, Genome-wide number of accessible base pairs reported by the 1000 Genomes Project. 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved 


Extended Data Figure 3 | FineSTRUCTURE shared ancestry analysis. 
ChromoPainter and FineSTRUCTURE results, showing both inferred 
populations and the underlying (averaged) number of haplotypes that an 
individual in a population receives (rows) from donor individuals in other 
populations (columns). 108 populations are inferred by FineSTRUCTURE. 
‘The dendrogram shows the inferred relationship between populations. 


‘The numbers on the dendrogram give the proportion of MCMC iterations 
for which each population split is observed (where this is less than 1). 
Each ‘geographical region’ has a unique colour from which individuals 

are labelled. The number of individuals in each population is given in 

the label; for example, ‘Altalians; 3Albanians' is a population of size 7 
containing 4 individuals from Italy and 3 from Albania. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | MSMC genetic split times and outgroup fs 
results. a, The MSMC split times estimated between each sample and a 
reference panel of nine genomes were linearly interpolated to infer the 
broader square matrix. b, c, Summary of outgroup f's statistics for each 
pair of non-African populations or an ancient sample using Yoruba as 
an outgroup. Populations are grouped by geographic region and are 
ordered with increasing distance from Africa (left to right for columns 
and bottom to top for rows), Colour bars at the left and top of the heat 
map indicate the colour coding used for the geographical region. 
Individual population labels are indicated at the right and bottom of the 
heat map. The fs statistics are scaled to lie between 0 and 1, with a black 
colour indicating those close to 0 and a red colour indicating those close 
to 1. Let mand M be the minimum and maximum f; values within a 
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given row (that is, focal population). That is, for focal population X 

(on rows), m=minY,¥2X f,(X, ¥; Yoruba) and M=maxY,Y=X f(X, Ys 
Yoruba). The scaled fs statistic for a given cell in that row is given by 
Foscated= (fs — m)/(M — m), $0 that the smallest fs in the row has value 
Sos: (black) and the largest has value facatea= 1 (red). By default, the 
diagonal has value fi.cseq= 1 (Fed). The heat map is therefore asymmetric, 
with the population closest to the focal population at a given row having 
value fiscaled =1 (ted colour) and the population farthest from the 

focal population at a given row having value fiscaeu = 0 (black colour). 
‘Therefore, at a given row, scanning the columns of the heat map reveals 
the populations with the most shared ancestry with the focal population 
of that row in the heat map. 
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Extended Data Figure 5 | Geographical patterns of genetic diversity. 
Isolation by distance pattern across areas of high genetic gradient, using 
Europe as a baseline. The samples used in each analysis are indicated by 
coloured lines on the maps to the right of each plot. a-d, The panels show 
Fer asa function of distance across the Himalayas (a), the Ural 


‘mountains (b), and the Caucasus (c) as reported on the colour-coded map 
(d). e, Effect of creating gaps in the samples in Europe. f, g, We tested the 
effect of removing samples from stripes, either north to south (f) or west 
to east (g), to create gaps comparable in size to the gaps in samples in the 
dataset. h, Effective migration surfaces inferred by EEMS. 
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Extended Data Figure 6 | Summary of positive selection results. a, Bar 
plot comparing frequency distributions of functional variants in Africans 
and non-Africans. The distribution of exonic SNPs according to their 
functional impact (synonymous, missense and nonsense) as a function 
of allele frequency. Note that the data from both groups was normalized 
for a sample size of n =21 and that the Africans show significantly 
(2 P< 1x 107) more rare variants across all sites classes. b, Result of 
1,000 bootstrap replica of the Ryvy test for a subset of pigmentation genes 
highlighted by Genome Wide Association Studies (GWAS, n= 32). The 
horizontal line provides the African reference (x= 1) against which all 
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other groups are compared. The blue and red marks show the 95th and 
the 5th percentile of the bootstrap distributions respectively. Ifthe 95th 
percentile is below 1, then the population shows significant excess of 
missense variants in the pigmentation subset relative to the Africans, 
Note that this is the case for all non-Africans except the Oceanians, 

6, Pools of individuals for selection scans. fineSTRUCTURE- based 
co-ancestry matrix was used to define twelve groups of populations 

for the downstream selection scans. These groups are highlighted in the 
plot by boxes with broken line edges. The number of individuals in each 
group is reported in Supplementary Table 1:3.2-. 
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Extended Data Figure 7 | Length of haplotypes assigned as African by 
fineSTRUCTURE as a function of genome proportion. a, 447 Diversity 
Panel results, showing label averages (large crosses) along with individuals 
(small dots). b, Relative excluded Diversity Panel results, to check for 
whether including related individuals affects African genome fraction. 
Individuals that shared more than 2% of genome fraction were forbidden 
from receiving haplotypes from each other, and the painting was re-run on 
a large subset of the genome (all run of homozygosity (ROH) regions from 
any individual). ¢, ROH-only African haplotypes. To guard against phasing 
errors, we analysed only regions for which an individual was in a long 
(>500kb) run of homozygosity using the PLINK command 
“homozyg-window-kb 500000-homozyg-window-het 0-homozyg- 


America 
density 10: Because there are so few such regions, we report only the 
population average for populations with two or more individuals, as well 
as the standard error in that estimate. Populations for which the 95% 
confidence interval passed 0 were also excluded. Note the logarithmic axis. 
4d, Ancient DNA panel results. We used a different panel of 109 individuals 
which included three ancient genomes. We painted chromosomes 11, 21 
and 22 and report as crosses the population averages for populations with 
two or more individuals. The solid thin lines represent the position of each 
population when modern samples only are analysed. The dashed lines 
lead off the figure to the position of the ancient hominins and the African 
samples, 
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Extended Data Figure 8 | See next page for caption. 
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Extended Data Figure 8 | MSMC Linear behaviour of MSMC split 
estimates in presence of admixture. a-c, The examined Central Asian (a), 
East African (b), and African-American (c) genomes yielded a signature 
of MSMC split time (truth, left-most column) that could be recapitulated 
(reconstruction, second left-most column) as a linear mixture of other 
‘MSMC split times. The admixture proportions inferred by our method 
(top of each admixture component column) were remarkably similar to 
the ones previously reported from the literature. d, MSMC split times 
calculated after re-phasing an Estonian and a Papuan (Koinanbe) genome 
together with all the available West African and Pygmy genomes from 
our dataset to minimize putative phasing artefacts. The cross coalescence 
rate curves reported here are quantitatively comparable with the ones 

of Fig. 2a, hence showing that phasing artefacts are unlikely to explain 


the observed past-ward shift of the Papuan-African split time. e, Box 
plot showing the distribution of differences between African—Papuan 
and African—Eurasian split times obtained from coalescent simulations 
assembled through random replacement to make 2,000 sets of 

6 individuals (to match the 6 Papuans available from our empirical 
dataset), each made of 1.5 Gb of sequence. The simulation command line 
used to generate each chromosome made of 5 Mb was as follows, where 
xis the variable for the divergence time used. x= 0.064, 0.4 or 0.8 for 

the xO0A, Denisova (Den) and Divergent Denisova (DeepDen) cases, 
respectively, msOancient2 10 1. 065.05 -t 5000. -r 3000, 5000000 -17 111 
1222-en0,1.2-en0,2,2-en0. 3.2 -en 0. 4 2 -es 025 7.96 -en .025 8,2 
-€),03 7 6 -€).04 6 5 -£).060 8 3 -ej.061 4 3 -€).062 2 1 -¢).063 3 1 -ej x15. 
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Extended Data Figure 9 | Modelling the xOoA components with three-component mixture including xOoA (using the same colour scale) 


FineSTRUCTURE. a, Joint distribution of haplotype lengths and 
derived allele count, showing the median position of each cluster and all, 
haplotypes assigned to it in the maximum a posteriori (MAP) estimate. 


(right). The root mean square error (RMSE) residuals without xO0A 
are larger (RMSE = 0.0055 compared to RMSE = 0.0018) but more 
importantly, they are also structured. c, Assuming a mutational clock and 


Note that although a different proportion of points is assigned to each in 
the MAP, the total posterior is very close to 1/K for all. The dashed lines 
show a constant mutation rate. Haplotypes are ordered by mutation rate 
from low to high. b, Residual distribution comparison between the 
two-component mixture using EUR.AFR and EUR.PNG (left), and the 


a correct assignment of haplotypes, we can estimate the relative age of 
the splits from the number of derived alleles observed on the haplotypes. 
This leads to an estimate of 1.5 times older for XOoA compared to the 
Eurasian—Africa split. 
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Extended Data Figure 10 | Proposed xOoA model. A schematic 
illustrating, as suggested by the results presented here, a model of an 

early, extinct Out-of-Africa (xOoA) signature in the genomes of Sahul 
populations at their arrival in the region. Given the overall small genomic 
contribution of this event to the genomes of modern Sahul individuals, 

we could not determine whether the documented Denisova admixture 
(question marks) and putative multiple Neanderthal admixtures took place 
along this extinct OoA. We also speculate (question mark) people who 
migrated along the xOoA route may have left a trace in the genomes of the 
Altai Neanderthal as reported by Kuhlwilm and colleagues" 
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Advances in genome assembly and phasing provide an opportunity 
to investigate the diploid architecture of the human genome and 
reveal the full range of structural variation across population 
groups. Here we report the de novo assembly and haplotype phasing 
of the Korean individual AK1 (ref. 1) using single-molecule real- 
time sequencing’, next-generation mapping’, microfluidic: 
based linked reads‘, and bacterial artificial chromosome (BAC) 
sequencing approaches. Single-molecule sequencing coupled with 
next-generation mapping generated a highly contiguous assembly, 
with a contig N50 size of 17.9Mb and a scaffold N50 size of 44.8Mb, 
resolving 8 chromosomal arms into single scaffolds. The de novo 
assembly, along with local assemblies and spanning long reads, 
closes 105 and extends into 72 out of 190 euchromatic gaps in 
the reference genome, adding 1.03 Mb of previously intractable 
sequence. High concordance between the assembly and paired-end 
sequences from 62,758 BAC clones provides strong support for the 
robustness of the assembly. We identify 18,210 structural variants 
by direct comparison of the assembly with the human reference, 
identifying thousands of breakpoints that, to our knowledge, have 
not been reported before. Many of the insertions are reflected in 
the transcriptome and are shared across the Asian population. We 
performed haplotype phasing of the assembly with short reads, 
long reads and linked reads from whole-genome sequencing and 
with short reads from 31,719 BAC clones, thereby achieving phased 
blocks with an N50 size of 11.6Mb. Haplotigs assembled from single- 
molecule real-time reads assigned to haplotypes on phased blocks 
covered 89% of genes. The haplotigs accurately characterized the 
hypervariable major histocompatability complex region as well as 
demonstrating allele configuration in clinically relevant genes such 
as CYP2D6. This work presents the most contiguous diploid human 
genome assembly so far, with extensive investigation of unreported 
and Asian-specific structural variants, and high-quality haplotyping 
of clinically relevant alleles for precision medicine. 

Although massively parallel sequencing approaches have been 
widely used to study genomic variation, simple alignment of short, 
reads to a reference genome cannot be used to investigate the full 
range of structural variation and phased diploid architecture, which 
are important for precision medicine. By contrast, the single-molecule 
real-time (SMRT) sequencing platform produces long reads that can 
resolve repetitive structures effectively, We integrated this technology 
with several other sequencing approaches to construct a high-quality 
Korean diploid genome assembly (Extended Data Fig. 1). 

SMRT sequencing of the genome of a Korean individual AKA, for 
whom we have previously reported the annotated variations assessed 
with BAC clones and array comparative genomic hybridization’, was 
performed at 101 x coverage using Pacific Biosciences (PacBio) RSII 


’, Jong-Yeon Shin! & Changhoon Kim* 


(Extended Data Fig. 2a). Reads were assembled and error-corrected 
with FALCON and Quiver’ to generate 3,128 contigs with a contig 
N50 length of 17.9 Mb (Extended Data Table 1, Extended Data Fig. 2b 
and Supplementary Tables 1-3). To anchor these contigs into larger 
scaffolds, we used next-generation mapping (NGM) from BioNano 
Genomics Irys System, which produces physical maps with unique 
sequence motifs that can provide long-range structural information 
of the genome, Two rounds of NGM at 97 and 108 x coverage were 
performed, with the second designed to protect fragments better from 
breakage at fragile sites, providing improved long-range anchoring 
(Supplementary Table 4). The optical maps were assembled de novo 
into genome maps, Hybrid scaffolding of the contigs and genome maps 
resulted in 2,832 scaffolds with a scaffold N50 size of 44.8 Mb (Extended 
Data Table 1 and Extended Data Fig. 3a). Because NGMs provide 
orders of magnitude longer range information (Supplementary Table 4) 
compared tolong reads from the SMRT platform (Supplementary Table 1), 
we relied on the genome map when there were conflicts between the 
two datasets. Checks for consistency between genome maps and contigs 
corrected potential assembly errors within 23 contigs (Extended Data 
Fig. 3b and Supplementary Table 5). The final assembly after polishing 
with Illumina reads (Extended Data Fig. 4a) is characterized by marked 
contiguity that has not been achieved by non-reference assemblies of 
the human diploid genome** so far, and improves on the previous best® 
N50 length by 18 Mb (Table 1). The largest 91 scaffolds, for example, 
cover 90% of the genome and 8 chromosomal arms are spanned by 
single scaffolds (Fig, 1). 

The scaffolding accuracy of the AK1 assembly was assessed using 
paired-end sequences from AK] BAC library! from 62,758 BAC clones 
(Extended Data Fig. 1). Most (95.4%) of the uniquely aligned BAC 
clones were in concordance with the assembly (Extended Data Table 2), 
as expected since the genomic DNA originated from the same 
individual. From the set of BAC clones that aligned concordant with 
the reference genome, 99.8% also aligned concordant with the AK 
assembly, with most of the discrepancies caused by phase differences 
(Supplementary Table 6). The base accuracy of the assembly was 
assessed by Illumina short reads (72). The read-depth distributions 
of the reads mapped to GRCh37, GRCh38 and AK1 show similar 
patterns (Extended Data Fig. 4b). The estimated base-level error rate of 
the assembly was less than 10° based on the count of single nucleotide 
polymorphisms (SNPs) with unexpected alleles (Extended Data Fig, 4c 
and Supplementary Table 7). 

We used the AK1 assembly to close gaps remaining in human 
genome reference GRCh38. Of 190 euchromatic gaps (Supplementary 
Table 8), 65 were closed entirely by our de novo assembly (Fig, 1b and 
Extended Data Fig, 5). Local realignment and reassembly, and use 
of spanning reads, resolved a further 40 gaps. The closed gaps were 
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Figure I | AK1 de novo assembly scaffolds compared to GRCh38. 

a, Scaffold coverage over GRCh38 per chromosome. The blue shading 
represents scaffold size, with darker segments for longer scaffolds. Eight 
chromosomal arms are spanned by single scaffolds. Closed euchromatic 
gaps are labelled in red on each chromosome, with the total number of 
gaps in grey. b, Number of gaps closed using the AK1 assembly (blue), 
local assembly of long reads (light blue), and long reads alone (red). 

‘The number of extended gaps with AKI assembly is represented in yellow, 
th long reads in green and open gaps in grey. The 65 dot plots of gaps 


filled with a total of 364kb of sequence into 1.5 Mb (Supplementary 
Table 9 and Supplementary Information). We also extended into 
72 of the 85 remaining gaps with the addition of 663 kb of sequence into 
4.1 Mb. These locations, previously intractable using only short reads, 
commonly contained simple tandem repeats, as reported previously®°. 
One example (Fig, Ic, d) illustrates two gaps resolved by AK1 assembly 
with supporting evidence from BACs and genome maps. 

We identified 18,210 structural variants (SVs), including 7,358 
deletions, 10,077 insertions, 71 inversions, and 704 complex variants 
ata base resolution through the direct comparison between the AKI 
assembly and the human reference genome GRCh37 (Supplementary 
Tables 10, 11). We were able to validate 271 out of 276 SVs with BAC 
contigs generated by SMRT sequencing (Supplementary Table 12). 
Compared to previous studies®*“"', a total of 11,927 variants were 
previously unreported, which account for approximately 47% (3,465) 
and 76% (7,710) of all deletions and insertions, respectively (Fig, 2a and 
Extended Data Fig, 6a). Of the new SVs, 86% were highly enriched for 
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ep (9.25 0p) 
closed with the AK1 assembly can be found in the AK1 genome browser 
(http://211.110.34,36/gbrowse2). ¢, AK assembly resolving two gaps 
along with BACs and optical map suggests that gap_367 and both its edges 
(red and black bars) shrink to zero, whereas gap_368 expands to 144kb 
(yellow bar). d, Three dot plots show how unique sequences have been 
added to the reference genome. Reference-reference (top left), reference— 
AKI assembly (top right) and AKI-AK1 (bottom right). A and B indic: 
deleted GRCh38 sequence around gap_367. 


clusters of mobile and tandem repeats (Extended Data Fig, 6b). PacBio 
long-read sequencing of the corresponding transcriptome revealed that 
155 isoforms are expressed from 54 novel insertion loci, indicating the 
existence of functional elements in human genomes that were probably 
undetectable using short reads (Supplementary Table 13). A total of 
4,326 deletions and 5,833 insertions occurred within 6,073 genes. Out 
of 615 exonic variants, 427 were new, and 68% of them did not affect 
protein functionality by maintaining the reading frame or occurring 
within non-protein coding genes. Among the new amino-acid- 
changing variants, 77% were composed of mobile or tandem repeats 
(Supplementary Table 14), and functional annotation clustering with 
the 31 genes, which contain the remaining non-repetitive variants, 
using DAVID? showed that they were predominantly related to ion 
binding, epidermal growth factor, and fibronectin. 

Investigation of the insertions suggested that the AK1 sequences 
consist not only of repeats and duplications, but also of unique 
sequences that are not found in the reference genome. To examine 
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Figure 2 | AK1 SV distribution and Asian-specific variants. 
a, Distribution of insertions (red/orange) and deletions (cyan/dark blue) 
between AKI and GRCh37, compared to SVs identified from previous 
studies. In total. 47% and 76% of the insertions and deletions, respectiv. 
were previously unreported. b, Allele frequency of 45 Asian-specific 

insertions (>0.3 allele frequency difference; <0.5 non-Asian all 


frequency). The coverage for the genic insertions was calculated from 


whether the unique sequences are universal or ancestry specific, 
we aligned raw reads from high-coverage 1000 Genomes Project 
samples! and additional high-coverage Asian samples against our 
AK] assembly, and compared the normalized read depths between 
four ancestral groups. Out of 853 insertions, encompassing 1.7 Mb, 
which were found in all of the ancestral groups, 800 insertions were 
also called from the variant analysis with respect to GRCh38, and 
as such are candidates for addition to the human reference genome 
(Supplementary Tables 15, 16). Moreover, 400 insertions showed highly 
polymorphic frequency variability across the populations, and 76 of 
them, including 45 genic insertions, were Asian specific, Among the 
genic insertions, we found that a 592-bp insertion within POU2F3, 
reported to have distinctly variable haplotype frequencies among 
populations’, was comprised of 452 bp of unique sequence between 
two 140-bp duplications (Fig. 2b). We also identified numerous large 
insertions with higher frequency in the Asian population, such as a 
4,539-bp insertion in HRASLS2. Next, we investigated the haplotype 
structures associated with Asian-specific variants by using linkage 
disequilibrium blocks inferred from 1000 Genomes Project Asian 
samples'®. Among the variants, 39 insertions were present within 
the blocks, and 82% of them were located on the same block as a 
homozygous AK1 SNP, the frequency of which was highest in the Asian 
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(EAS) linkage disequilibrium (LD) block, sharing a similar population 
allele frequency with the adjacent AKI SNPs. AFR, African; EUR, 
European; SAS, South Asian. 


population (Supplementary Table 17). One of the insertions, found 
within ANO2, had a similar allele frequency with adjacent homozygous 
AK1 SNPs within the same linkage disequilibrium block, suggesting that 
the insertion shares single ancient haplotype with the SNPs (Fig, 2c). 
Our findings demonstrate the important genomic differences of Asian 
ancestral group from the others, and highlight the need for further 
genomic studies focused on individuals outside of European ancestry to 
describe the full range of functionally important variations in humans. 

To reflect the diploid genome structure better, we built separate 
de novo assemblies (haplotigs) representing the two haplotypes of 
each homologous chromosome pair. Phasing was performed with 
PacBio long reads, Illumina short reads, 10X Genomics linked reads* 
(30x), and reads from BACs representing a single haplotype (47x). 
Heterozygous SNVs called from these methods are unambiguously 
assigned to two alternative phases, producing phased blocks with 
an N50 length of 11.6 Mb, considerably longer than previously 
reported***'5 (Table 1). We assessed the accuracy of the phased 
blocks against the end sequences of BACs, and found a long-range 
switch error rate to be under 0.3%. SMRT reads were then partitioned 
into the two phases in which sufficient marker SNVs were present. 
‘The two partitioned read sets were assembled de novo into haplotigs 
(Table 1 and Extended Data Table 3). 
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Table 1 | Comparison of human de novo assembly and haplotype phasing summary statistics 


AKL HuRet YH2.0 NAI2878 GRCHaB 
‘Assembly approach WGS and BAC wGs WGS and fosmid was BAC and fosmid 
‘Sequencing and physical PacBio and BioNano Sanger Illumina and CG —-PacBio and BioNano Sanger, FISH, OM and. 
mapping fingerprint contigs 
De novo assembly algorithm FALCON Celera SOAPdenovo2 Celera and FALCON Multiple methods 
Phasing approach De novo Reference-guided De novo Reference-guided NA 
‘Scaffald/contig N5O (Mb) 44.85/17.92 17.66/0.11 20.52/0.02 26.83/1.56 6779/5641 
Scaffold/eontig L50 21/50 48/7,164 39/40,005 37/532 16/19 
No. of scatfolds/contigs 2832/4206 4,530/71,333 125,643/361,157 18,903/21,235 735/1,385 
No. of gaps 264" 68,109; 235,514¢ 2,332" 999 
Total gap length (Mb) 37.34 34.43 105.20 146.35 159.97 
Total bases/non-N bases in 2,904,207,288 2,844,000,504 2,911,235,363 3176574379 3,209,286,105 
assembly (bp) 12,366,687 809 /2,809,571,127 /2,306,031,133 13,030,222,093 73,049,316,098 
Phased block NSO (Mb) 11.55 035 NA 015 NA 
No. of haplotigs 18.964 NA 24,597 NA NA 
Haplotig N50 (kb) 875 NA 484 NA NA 
Haplotig sum (bp) 4,804,460,182 NA 5,152,727,603 NA NA 


‘We camiparad the sequencing platform, algorithms, acerbly and phasing Satistce of human assemble so far. The comparisan dervonsival 


"ower of single-molecule technologies to 


generate assemblies with superior assembly statistics than that achieved by shor-read sequencing. The assembly statistics were obtained from the NCBI and if the summary statistics ware nat 
available from NCB, the numbers were directly acquired fram relevant papers, The accession numbers for HuRet”, YH._ 20 (ret. 8), NAL2B72 (ret. 6) and GRCH38 assemblies are GCA_000002125.2, 
{GCA_000004845.2, GCA 001013985.1 and GCA. 000001405.15, respectively. CG, complete genomics; FISH, fluorescent in-situ hybridization; NA, nat applicable; OM, optical mapping: WGS, 


whole-genome shotgun. 
Number of spanned gaps 
‘}Number of spanned and unspanned gaps. 
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Figure 3 | Circular visualization of phased 
blocks with phase-specific expression and two 
phased regions of MHC class Il and CYP2D6. 
a, Genome-wide map of highly heterozygous 
regions and expression levels of haplotype A and 
Bin log scale. b, HLA genes in the MHC class 

I region. This highly variable, complex region 
contained many SVs, making it difficult to phase 
against the reference genome, but allowed full 
resolution through the de nove approach, For 
detailed comparison, see Extended Data Fig. 7. 

¢, Both haplotypes of CYP2D6 and CYP2D7. 

A duplicated copy of CYP2D6 was fused with the 
last exon of CYP2D7 on haplotype B. 
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Comparison of the haplotigs to the human reference led to identi- 
fication of haplotype-specific alleles including SNPs, short indels and 
SVs (Supplementary Table 18). In addition to the SVs called from the 
assembly, 13,436 heterozygous haplotype-specific SVs were identi- 
fied from haplotigs. We tested the accuracy of these SVs against BAC 
contigs on the same phase, and found that 67 out of the 69 that could be 
assessed matched perfectly (Supplementary Table 19). The combined 
length of SNVs, indels and SVs that were heterozygous between the 
two haplotigs was 69.8 Mb. Moreover, we were able to measure the 
expression level from each haplotype genome widely (Fig. 3a). 

‘We examined the haplotypes of human leukocyte antigen (HLA) 
genes in detail, and confirmed the haplotypes using targeted SMRT 
sequencing (Supplementary Table 20). To avoid common problems!” 
associated with hyperpolymorphic patterns of allelic variation, 
major histocompatibility complex (MHC) class I and II regions 
were assembled independently. The MHC class II region was phased. 
successfully despite a large number of SVs, highlighting the utility of 
our de novo phasing approach (Fig. 3b and Extended Data Fig. 7). Our 
approach also allowed a clinically important duplication of CYP2D6 
to be detected and assigned to one phase (Fig. 3c). This result demon- 
strates that de novo assembly-based phasing has advantages in resolving 
challenging hypervariable regions, and could be used further for phar- 
macogenomics (Supplementary Discussion). 

Allelic configuration is also particularly important for recessive 
traits. For example, we were able to phase two genes that contained 
more than two nonsynonymous, heterozygous alleles known to be 
associated with recessive diseases (Supplementary Table 21). Variants 
in MEFV"* and ADAMTS13 (ref. 19), which are predicted to cause 
familial Mediterranean fever and Upshaw-Shalman syndrome under 
the autosomal recessive inheritance pattern, respectively, were found 
in cis configuration, with the partner haplotype left intact. 

‘These results demonstrate the power of de novo genome assembly 
and phasing by integrating SMRT sequencing, genome maps, linked 
reads and BACs for the generation of high-quality contiguous scaffolds, 
the detection of the full range of SVs, and for understanding the haplo- 
type structure in clinically relevant genes for precision medicine. 


Online Content Methods, slong with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only in the online paper. 
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METHODS 

No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

AKL cell line, An immortalized lymphoblastoid cell line was established from 
the AK! individual through Epstein-Barr virus transformation of mononuclear 
cells (Seoul Clinical Laboratories Inc.). Full pathogen testing was performed 
and maintained in a mycoplasma-free facility. AK lymphoblastoid cell line was 
cultured in RPMI 1640 media containing 15% FBS at 37°C in a humidified 5% 
CO; environment, The approval number C-0806-023-246 for the AK1 individual 
‘was assigned based on the guidelines from the Institutional Review Board of Seoul 
National University. 

PacBio data generation. Genomic DNA was extracted from AKI cells using the 
Gentra Puregene Cell Kit (Qiagen). Large-insert PacBio library preparation was 
conducted following the Pacific Biosciences recommended protocols. In brief, 
a total of 601g AK genomic DNA was sheared to ~20kb targeted size by using 
Covaris g-TUBEs (Covaris). Each shearing processed 101g input DNA and a 
total of 6 shearings were performed, The sheared genomic DNA was examined 
by Agilent 2100 Bioanalyzer DNA12000 Chip (Agilent Technologies Inc.) for size 
distribution and underwent DNA damage repair/end repair, blunt-end adaptor 
ligation followed by exonuclease digestion, The purified digestion products were 
loaded onto pre-cast 0.6% agarase for 7-50kb size selection using the BluePippin 
Size Selection System (Sage Science), and the recovered size-selected library 
products were purified using 0.5 pre-washed Agencourt AMPure XP beads 
(Beckman Coulter). The final libraries were examined by Agilent 2100 Bioanalyzer 
DNA12000 Chip for size distribution and the library concentration was deter- 
mined with Qubit 2.0 Fluorometer (Life Technologies). We sequenced with the 
PacBio RSII instrument with P6 polymerase binding and C4 chemistry kits (P6Ci). 

A total of 380 SMRT Cells were used to yield 101-fold whole-genome sequence 
data. 

‘Sample preparation for BioNano Genomics, AKI cells were pelleted and washed 
‘with PBS; the final cell pellet was re-suspended in cell-suspension buffer using the 
CHEF Mammalian Genomic DNA Plug Kit (Bio-Rad). Cells were then embedded 
in CleanCut low-melt Agarose (Bio-Rad) and spread into a thin layer on a custom 
support (in development at BioNano Genomics). Cells were lysed using IrysPrep 
Lysis Buffer (BioNano Genomics), protease-treated with Puregene Proteinase K 
(Qiagen), followed by brief washing in Tris with S0mM EDTA and then washing 
in Tris with 1 mM EDTA before RNase treatment with Puregene RNase (Qiagen). 
DNA was then equilibrated in Tris with 50 mM EDTA and incubated overnight at 
4°C before extensive washing in Tris with 0.1 mM EDTA followed by equilibra- 
tion in NEBuffer 3 (New England BioLabs) at 1x concentration, Purified DNA 
in the thin layer agarose was labelled following the IrysPrep Reagent Kit protocol 
with adaptations for labelling in agarose. In brief, 1.25 1g of DNA was digested 
with 0.7 U NL.BspQI nicking endonuclease per microlitre of reaction volume in 
NEButffer 3 (New England BioLabs) for 130min at 37°C, then washed with TE Low 
EDTA Buffer (Afiymetrix), pH 8.0, followed by equilibration with 1 x ThermoPol 
Reaction Bufler (New England Biolabs). Nick-digested DNA was then incubated 
for 70min at 50°C using the IrysPrep Labelling mix (BioNano Genomics) and Taq 
DNA Polymerase (New England BioLabs) ata final concentration of 0.4 U pt. 
Nick-labelled DNA was incubated for 40 min at 37°C using the IrysPrep Repair 
mix (BioNano Genomics) and Taq DNA Ligase (New England BioLabs) ata final 
concentration of 1 U yl-!, Labelled-repaired DNA was then recovered from the 
thin layer agarose by digesting with GELase and counterstained with IrysPrep 
DNA Stain (BioNano Genomics) before data collection on the Irys System. The 
fragile site rescue process protects fragile sites by reducing the temperature of the 
labelling reaction and minimizes shear forces by restraining DNA in agarose until 
nicks are repaired. In this case, only the closest opposite-strand nick-pairs break. 

‘Sequencing library preparation using the GemCode platform. Sample indexing 
and partition barcoded libraries were prepared using GemCode Gel Bead and 
Library Kit (10% Genomics)*. Sequencing was conducted with Illumina Hiseq2500 
to generate linked reads. 

Ilumina data generation. Libraries were generated with PCR-free protocols. 

gDNA was sheared twice using Covaris $2 with cycling conditions of 10% duty 
cycle, Cycles/Burst 200, and Time 100s. The sheared DNA was processed using the 
Illumina TruSeq DNA PCR-Free LT Library Kit protocol to generate 5SObp inserts, 
which includes end repair, SPRI bead size selection, A-tailing, and Y-adaptor 
ligation. Library concentration was measured by qPCR and loaded on HiSeq X 
‘Ten instruments (PE-150) to generate 72-fold sequence coverage. 

DNA preparation from BAC clones. A total of 32,026 BAC clones were selected 
from the 252 384-well plates and re-plated into 96-well plates. Clones were grown 
‘overnight, and the cultures were used to prepare two additional replicates for the 
two 384-well plates that were stored at 80°C in LB medium containing 20% 
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glycerol. A total of 32,026 clone cultures with growth at ODs ranging from 0.6 to 
1.0 were pooled, pelleted and the DNA was extracted using the standard alkaline 
lysis method. In this procedure, a cell pellet was resuspended in 1501 of Qiagen 
buffer P1 with RNase and lysed with 150}1! of 0.2M NaOH, 1% SDS solution for 
Smin, Lysis was neutralized with the addition of 150) of 3 M sodium acetate, 
pH4.8. Neutralized lysate was incubated on ice for 30min, and DNA was collected 
by centrifugation for 15min at 15.7g at 4°C, concentrated by standard ethanol 
precipitation and resuspended in 25 il of 10mM ‘Tris-HCl, pH 8.5. 

PacBio sequencing of BAC clones. DNA from approximately 150 BAC clones with 
roughly equimolar concentration was combined into a single pool. A total of 10g 
from each pool DNA was sheared and fragments of insert size ranging from 
10 to 15kb were selected. Two libraries were prepared from the pooled DNA using 
PacBio SMRYbell library preparation kit v1.0. The libraries were quantified using 
a Qubit 2.0 fluorometer and each library was sequenced using two SMRT cells, 
swith P6C4 chemistry. 

Illumina sequencing of BAC clones. DNA from approximately 290 BAC clones 
with roughly equimolar concentration was combined into a single BAC pool. 
One nanogram of DNA from each pool was digested and fragments of insert size 
ranging from 500 to 550 bp were selected. In total, 109 libraries were prepared from 
the pooled DNA using Illumina-compatible Nextera XT DNA sample prep kit and. 
sequenced with HiSeq2500, 

Sample preparation for RNA sequencing. We extracted RNA from tissue using. 
RNAiso Plus (Takara Bio), followed by purification using RNeasy MinElute 
(Qiagen). RNA was assessed for quality and was quantified using RNA 6000 
Nano LabChip on a 2100 Bioanalyzer (Agilent). The RNA sequencing (RNA-seq) 
libraries were prepared as previously described, RNA library was sequenced with 
Ilumina ‘TruSeq SBS Kit v3 on a HiSeq 2000 sequencer (Illumina) to obtain L00bp 
paired-end reads. The image analysis and base calling were performed using the 
lumina pipeline with default settings. 

Sample preparation for isoform sequencing. Total RNA extracted from AKI 
cells with RNA integrity number (RIN) >8.0 was used for library preparation. The 
library was constructed following the Clontech SMARTer-PCR cDNA Synthesis 
Sample Preparation Guide. 1-2kb, 2~3kb, 3-6kb and >5kb libraries were selected 
by Sage, ELF purified, end-repaired and blunt-end SMRTbell adapters were ligated 
‘The fragment size distribution was confirmed on a Bioanalyzer HS chip (Agilent) 
and quantified on a Qubit fluorometer (Life Technologies). The fragment size 
distribution was validated on a Bioanalyzer HS chip (Agilent) and quantified on 
a Qubit fluorometer (Life Technologies). The sequencing was carried out on the 
PacBio RSI! instrument using P6CA. 

PacBio long-read de novo assembly. Around 31 million subreads were used for 
assembly with FALCON v0.3.0 (ref. 21) given length_cutoff parameter of 10kb for 
initial mapping to build pre-assembled reads (preads), and preads over 15kb were 
used (length_cutoff_pr) to maximize the assembled contig NSO (Extended Data 
Fig. 2). Primary and associated contigs were polished using Quiver’ 

BioNano Genomics genome map generation. Optical maps were de novo 
assembled into genome maps using BioNano assembler software (Irys System, 
BioNano Genomics). Single molecules longer than 150 kb with atleast 8 fluorescent 
labels were used to find possible overlaps (P-<1 x 10"), Next, these maps were 
constructed to consensuis maps by recursively refining and extending them by 
mapping single molecules (P<1 x 10~*), The consensus maps were compared 
and merged into genome maps when patterns matched (P< I x 10"), A second 
os of optical maps was obtained thereafter, and generated into genome maps with 
the same criteria 

Contig editing and hybrid assembly. Primary contigs were in silico digested into 
‘maps and were compared with genome maps for scaffolding. The scaffolding was 
visualized and performed with the Irys Viewer. When conflict occurred, the contigs 
‘were edited with the guidance of genome map. 

Assembly improvements, Paired-end reads from Illumina platform were aligned 
to the assembly using bwa”? mem, followed with duplication removal using 
Picard tools” Base-pair correction of the assembly was performed using Pilon”, 
Pilon mostly corrected single insertions and deletions in regions enriched with 
homopolymer. Contigs or scaffolds shorter than 10kb were excluded from the 
‘overall analysis to avoid results from spurious misassembly. 

Scaffold accuracy measurement with BAC clones. Scaffolding accuracy of the 
AKL assembly was assessed using the AKI BAC library'. AKI BAC end sequences 
(BES) were aligned to GRCh37, GRCh38 and AK1 assemblies using BWA. The 
BES placements were categorized by the alignment, orientation and separation 
of BES with respect to the assembly. The BES placement was determined to be 
concordant: (1) if the BES placement was placed in the same assembly unit; 
(2) ifthe paired end sequences were properly oriented; and (3) ifthe in silico insert 
size was between 50,000 and 250,000bp. Ifthe BES placements did not meet these 
conditions, the BES placement was defined to be discordant. In addition, if oly one 
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of the paired end sequences were aligned to the assembly, the BES placement was 
defined to be an orphan placement. If both paired-end sequences were unaligned 
to the assembly, the BES was defined to be unmapped. If either of the paired-end 
sequences were aligned to different positions of the assembly multiple times, the 
BES was defined to have multiple placements. 

Gap closure and SV analysis: alignment to the reference genome. ‘fo identify the 
precise genomic location of each assembly unit, we used LASTZ”> with parameters 
(-gapped -gap = 600,150,-hspthresh = 4500,-seed = 120f19 -notransition 
-ydrop = 15000-chain) to align each assembly unit to each chromosome in 
the human reference genome. Chaining procedure was followed to join the 
neighbouring local alignments into a single cohesive alignment. ‘The chained 
alignments of each assembly unit were processed to obtain a single alignment with 
the best alignment score. Ifthe selected alignment was not fully representative of 
the assembly unit, we selected a set of alignments that was better representative of 
the assembly unit. A netting procedure was then followed with the selected chained 
alignments. The chaining and netting procedures were applied using UCSC Kent 
tools and parallel processing was used when possible to increase computational 
speed. 

Gap closure of GRCh38. Gaps were classified into telomeric, centromeric, 
heterochromatic, acrocentric and euchromatic region according to the agp file and 
cytoband information provided by the Genome Reference Consortium (GRC) 
and UCSC genome browser. In total, 190 euchromatic gaps were targeted for gap 
closure with AK1 assembly. The gaps that could not be closed or extended with the 
AK. assembly were subjected to closure through local assembly using Canu™ or a 
contiguous subread. Subreads mapped 10kb upstream or downstream of the gap 
were chosen for local assembly. Alignment was performed with BLASR™* -bestn 3, 
and primary aligned reads with mapping quality of 254 were used. The assembled 
contigs were thereafter aligned to their respective gap position to precisely identify 
theadded sequences. Subreads used to close the gaps were chosen following criteria, 
described in the Supplementary Information, 

Assembly based variant detection. The alignments of the assembly to the 
reference genome were parsed to obtain SNPs, indels and SVs, which we defined 
as insertion, deletion, inversion and complex variants with event size equal to 
or greater than SObp. The complex SVs are the same as ‘double-sided insertion’ 
defined previously”. We used GRCh37 instead of GRCh38 for the main analysis 
for compatibility and comparison with previously reported structural variations. 
SV annotation. Repeat elements were annotated using RepeatMasker (-species 
‘human -no_is) and tandem repeat finder (TRE) (2.7 7 80 10 50 2000 -f-m -h-<d). 
SVs are classified accordingly if it is masked by at least 70% with a single type. 
‘Complex is defined as the SVs having either several annotated repeat elements, or at 
least 30% of the remaining sequence not annotated as repeat. Novelty was identified 
by comparing the breakpoints with 50% reciprocal overlap criterion. Functional 
annotation was performed using both GENCODE release v19 (GRCh37) and v21 
(GRCh38)” and the Ensembl Regulatory Build”, For those SVs that occurred 
within gene regulatory domains, we annotated with the nearest gene name. 
SV located within pericentromeric regions (5 Mb flanking annotated centromeres) 
and subtelomeric regions (150kb from the annotated telomeric sequence) were 
annotated as heterochromatin, Both pilot and strict accessibility genome mask 
regions (version 20141020) were downloaded from ftp:/fip.1000genomes.ebi.ac.uk/ 
voll ftp/release/20130502/supporting/accessible_genome_masks/. Segmental 
duplication sites were downloaded from the UCSC table browser. ‘To simplify 
categorization of the SVs that lie within multiple functional regions, they were 
classified according to the order of priority as follow: coding sequence, untr 
lated region, intron, transcription-factor-binding site, promoter, enhancer, C 
(transcriptional repressor), and intergenic. To annotate whether the SVs called 
from GRCh37 were also shared with GRCH38 SV sets, we compared each AKI 
breakpoints with 50% reciprocal overlap criterion. In addition, we assessed whether 
the SVs called from GRCh38 were also represented in the alternative contigs by 
measuring the concordance against the SV regions including the surrounding 5Obp 
from the breakpoints, 

Asian-specific SVs, Population allele frequency of SVs was obtained by aligning 
reads from 38 high-coverage samples from five different ancestral backgrounds 
(African, American, European, East Asian, and South Asian) to the AK! assembly. 
We obtained whole-genome sequencing data of 23 individuals from the 1000 
Genomes Project and we additionally sequenced 15 East Asian individuals 
(5 Japanese, 5 Chinese and 5 Koreans). Analysis candidates were selected from 
the insertions with less than 70% of repeats. We excluded any duplications among 
the insertions that are mapped to GRCh37 using BLAST (-evalue le-10 -perc_ 
identity 90 -qcov_hsp_perc 90). The regions that have been recognized as mobile 
element or tandem repeat by RepeatMasker and TRE softwares were masked 
for analysis. Normalized read depth within the unique sequence was achieved 
by dividing the read depth, which was calculated using samtools bedcov, by the 
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median genome coverage. The insertions were determined to be highly polymor- 
phicif there were greater than or equal to 0.3 variant frequency differences across 
the different populations. Asian-specific insertions were chosen by selecting the 
insertions with equal or above 0.3 allele frequency difference between Asian and 
non-Asian population as well as non-Asian allele frequency with equal or below 
0.5. Asian linkage disequilibrium blocks were obtained from East Asian samples 
in the 1,000 Genomes Project phase 3 using S-MIG++ algorithm? (-maf 0.05 -ci 
AV - probability 0,95). Linkage disequilibrium blocks with below 0.8 haplotype 
diversity index were excluded. 

De novo phasing markers. We performed phasing against the de novo assembly. 
SNPsand short indels called from whole-genome sequencing (72) of short reads 
were phased with linked reads. The non-redundant set of PacBio subreads were 
aligned to the assembly, and corrections were applied by calculating the maximum 
likely variant allele for the phased variants based on the read depth. A phased block 
‘was defined as the region spanning two markers which had a subread or linked read 
information providing phasing. Similar to the linked reads, Illumina sequenced 
BAC phase information was used to correct phasing markers and extend phased 
blocks, Correction and other bioinformatics methods were performed using an 
in-house script, described in the Supplementary Information, 

Switch error of phasing markers. Long-range switch error measurements were 
obtained using BAC end sequences. The end sequences were aligned to the AK 
assembly with bwa mem, and the base allele of the phasing marker site was called 
with the corresponding BAC information. When switching occurred for more 
than two marker sites in a phased block, it was defined asa long range switch. The 
long-range switch error rate was calculated as: no. of long range switches/no. of 
phasing markers, 

Haplotig assembly. Using the final set of phasing markers, subreads were classified 
into sets of haplotype A or B when >85% of the phasing markers agreed. When a 
subread contained no marker, it was classified as homozygous. Through the read 
depth, phasing markers that were missed in previous steps were additionally called 
for homozygous regions adjacent to phased blocks. Subreads in haplotype A or 
homozygous regions were assembled into haplotig A, and haplotype B into haplotig 
Bwith Canu”, Haplotigs for MHC class [and Il were assembled separately to avoid 
misassemblies owing to high sequence homology between HLA genes. In this case, 
subreads phased as homozygous were used with subreads of haplotype A and B, 
Homozygously phased subreads flanked on each side of a sequencing gap belonged 
on haplotype A and B, respectively, and were re-classified during assembly. 
Haplotype-specific variant calling and annotation. Haplotype-specific variants 
were called following the assembly-based variation calling method. Owing to 
possibilities of false positives introduced by misassembly, phased variants that 
agreed with initial variants called with whole genome sequencing reads were used. 
for further analysis. After functional annotation using GENCODE v19 (ref. 30), 
disease risk alleles were screened using ClinVar. Haplotyping of CYP2D6 was 
done by comparing haplotigs to M3388 following CYP2D6 nomenclature. 

De novo assembly of BAC clones and SV validation. BACs identified to be 
discordant in size (>1 kb) were pooled and sequenced with the SMRT platform. 
‘The subreads were assembled using Canu” after screening and removing 
Escherichia coli or vector sequences with CrossMatch™. The assembled BAC 
contigs were polished with Quiver, The BAC contigs were, thereafter, used to 
validate AKI assembly-based or phase-specific SVs by assessing the concordance 
between the assembly and the BAC contig at sites of detected SVs. 
Heterozygosity and allele-specific expression. On the basis of the alignments 
of haplotigs to GRCh37, haplotig A and B were localized to compare partner 
sequences. ‘The number of different bases were summed in every 5 Mb distance, 
and percentiled to draw in the Fig, 3a, RNA-seq reads were trimmed and 
aligned to GRCh37 using STAR aligner with the two-pass mapping strategy as 
recommended. Duplicates were removed using Picard tools, and variants were 
called using HaplotypeCaller and VariantFiltration following GATK best practices 
‘on RNA-seq™. Sites with supportive evidence of altered variation in RNA-seq 
have been extracted from the final vef file, and ASEReadCounter™ was applied to 
remove reads with low base quality. Read counts are annotated to the phase-specific 
variants called from haplotigs using in-house scripts. When read depth for one 
allele was over 30, it was considered as ‘expressed’ 
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Extended Data Figure 1 | Global overview of data generation and 
sequencing throughput. Flowchart of the data generation, processing, 


and analysing for the de novo assembly and haplotype phasing of the 62,758), Illumina (31,719) and SMRT (307) platform. §Linked-read data 
AKI diploid genome. +The SMRT platform sequencing throughput were additionally generated with the GemCode platform to produce 

is described in Supplementary Table 1. +The number of read and 1,153,598,732 reads from high molecular mass DNA with an average insert 
sequencing throughputs from the Illumina platform are 1,635,192,864 size of 100 kb. 


and 249,914,122,464 bp, respectively. +AK1 BAC library was sequenced 
using Sanger capillary end sequencing (single end: 22,563, paired end: 
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Extended Data Figure 2 | Length distribution of SMRT subreads and is shown on the right. The contig N50 depends on the two parameters, 
FALCON parameter optimization for assembly. a, The y axis on the left related to the amount of error-corrected reads for final assembly, length_ 
shows the number of subreads with given length (bin size= 100 bp) on cutoff and length_cutoff_pr, respectively, where the former was fixed at 
the x axis, whereas the y axis on the right shows the sum of the length of ___10kb but the latter varied from 10 to 16kb. Black and green lines indicate 
subreads longer than or equal to the given length on the x axis, b, Effects of the changes of NSO for 72 and 101 x sequencing dataset, respectively. 
length cutoff parameters on contig N50 in de novo assembly by FALCON 
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Extended Data Figure 3 | Graphical representation of hybrid assembly 
and statistics for next generation map and genome map. a, The hybrid 

assembly approach aligns in silico generated maps from sequence contigs 
with genome maps. When genome maps bridge two contigs, a scaffold is 
produced. The comparison is visualized between the genome maps and 
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insilico map and the genome map are highlighted in red, and mismatches 
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Read Depth 
Extended Data Figure 4 | Assessment of assembly accuracy with 
homopolymer and read depth coverage generated with short reads. 

a, Distribution of corrections in homopolymer. Pilon mostly corrected 
the single base deletions in the assembly and the corrections are enriched 
in regions with long stretches of homopolymer. b, The read-depth 
distribution against AK1 assembly, AKI assembly with scaffolds >500 kb, 
GRCh37 and GRCh38. As the mean coverage depth of short reads was 
72x, a peak is shown around it representing the fraction of autosomal 
region. Another peak is shown in ~36%, which is half of the mean 
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coverage depth, representing the contigs derived from chromosomes 
X and Y. The fluctuating long tale is showing 3-copy and 4-copy of a 
haplotype, but more clearly observed with AK1 long scaffolds. The overall 
pattern is showing that more SVs are reflected in AK1 long contigs than 
the reference. The short contigs (<500kb) are only 120.4 Mb, comprising 
small fraction of the AKI assembly. c, Density plot of the homozygously 
altered allele read depth from long scaffolds (>500kb). Most variants are 
skewed in low allelic read depth, suggested to be mainly due to sequencing 
artefact or mapping bias. 
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Extended Data Figure 5 | An example of filled sequence that matches 
perfectly with the patch sequence (KN538365.1). a, One AK1 scaffold 
(Super-scaffold3_99) closes a 100-bp gap in chromosome 10, reducing 
the size of this gap to zero while it also removes a 4.7 kb false duplication 
found left of the gap. This information corresponds perfectly to the 
GRCh38 fix patch (KN538365.1) sequence covering this region, thus 
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ss 


Gap_301 (0 bp) 
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validating our assembly and gap closing accuracy. b, Six dot plots show 
the comparison between GRCh38, KN538365.1 and the AKI assembly, 
‘The dot plots are organized in the following manner: Reference-reference 
(top left), KN538365.1-reference (top middle), AKI-reference (top right), 
KN538365.1-KN538365.1 (centre middle), AK1-KN538365.1 (middle 
right) and AKI-AKI (bottom right), 
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a 
Novel Inversion, 48 — 
Known Inversion, 23 —) 
Novel 
Complex Known Insertion 
Inversion 
71 
Total SV Insertion 
18,210 10,077 
Novel 
Insertion 
Known 7,710 
Deletion 
3,893 
b 
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Extended Data Figure 6 | Number of SVs and repeat composition. of the SVs were unreported previously. b, Repeat composition of AK1 
a, Overall distribution of SVs. By direct comparison between AK1 insertion and deletion, Both insertions and deletions are mostly composed 
assembly and GRCh37 reference genome, deletion (red), insertion (blue), _ of mobile elements or tandem repeats. Complex is defined as the SVs 
inversion (green), and complex (grey) variants were detected. Outer pie having either several annotated repeat elements, or at least 30% of the 


chart represents new variants for each SV types. In total, 65% (11,927) remaining sequence not annotated as repeat. 
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Extended Data Figure 7 | MHC class II haplotigs alignment on owing to different sequence context in haplotigs. c, Dot plots of haplotig 
chromosome 6 and dot plots. a, MHC haplotigs A and B aligned on A and B to the alternative loci (ALT) patches of MHC region in hg38. 
GRCh37 chr6, The complex regions shown in Fig. 3a are in green bars, Haplotig A had the most similarities with chr6_GL000255v2_alt for the 
b, Dot plot of haplotig A and B to the reference genome. The region highlighted region in b. The blank vertical lines indicate ‘N' bases in the 
highlighted in red is giving many SVs when aligning on the reference reference ALT sequence. 
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Extended Data Table 1 | Summary of de novo assembly and phasing statistics 


No.of contigs Sum NSO Longest Average No.of Percent bases 
Dataset orscaffolds (Gb) (Mb) (Mb) (Mb) 459 gaps of gaps 
Contigs Assembly 3,128 28717778528 ° 0.00 
Scaffold V1 Scaffolding 2,927 289 «2041139992818 075 
Scatfolding 
Scaffold v2 Scaffolding 2,832 29 448 «1139 1.08 at 264 1.29 
Linked-reads 1,468 262 570 296 «178 143 
Phased Block nla 
apie 836 2641185 6508.16 


AC 


A 3,185 263-241 11.21 (0.83 328 
Haplotig nla 
B 15,816 219 032 252 0.14 2,012 


Phased blacks are measure by sub-read coverage based an markers an each step, These aubreads ae locally assembled fo bullé haploige A, hapotig assembled with subreads 
phased as haplotype A anc hamzygous: 8, haplotig B assembled with subreads phased as haplotype &: BAC, bactenal artificial chromosome sequenced with short reads; Fsr-GM, 
Fragile site rescued Genome Map: n/a, not applicable 
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Extended Data Table 2 | BAC clone paired-end sequence placements to AK1 and human reference 


Type of Placement GRCh37 GRCh38 AK1 v1 AK1 v2 
Paired End Placements 62,076 62,091 62,038 62,022 
Unique Placements 60,585 60,460 61,152 61,132 
Concordant Placements 56,328 56,486 58,359 58,340 
Scaffolding Placements 0 0 3,027 2,928 
Discordant Placements 4,257 3,974 2,793 2,792 
Discordant in Size 1,481 1,358 1,310 1,316 
Discordant in Orientation 1,619 1,555, 1,483 1,476 
Discordant in Chromosomes. 1,157 1,061 na na 
Multiple Placements 1,491 1,631 886 890 
Orphan Placements 655 642 698 713 
Unmapped 27 25 22 23 
Mean in silico insert size (bp) 102,788 102,785 102,928 102,947 
In silico insert size standard deviation (bp) 23,149 23,151 23,263 23,295 
Total 62,758 62,758 62,758 62,758 


The surnmary of BAG paired end:-read alignments tothe human reference and the AKI assembly. The end sequence placements indicate thatthe assembly qualily of AKI V2 Is comparable, 
to that af the human reference genome. 
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Extended Data Table 3 | Haplotig length and reference coverage on each chromosome 


Haplotig A Haplotig B 
No, Bases overage N50 Longest No. Bases Coverage NSO Longest 
haplotigs (Mb) (Kb) (Mb) haplotigs (mb) (kb) (Mb) 
chrt 175 2174 96% 2.2184 78 1,262 182.1 81% 337.8 15 
chr2 179 229.9 96% 2,328.1 93 1,403 193.8 81% 3155 15 
hrs 139 1948 100% 2,282.2 a8 1,250 156.7 80% 297.8 16 
chra 140 1746 93% 2,380.3 107 1,084 153.0 82% ati 16 
obs 122 172.9 97% 2,296.1 13 1,064 143.2 81% 300.3 13 
ohré 104 165.7 99% 2,048.4 67 969 136.5 82% 325.0 14 
chr? 123 149.41 96% 2,294.4 18 e78 1253 81% 316.0 18 
chs 7 144.2 101% 2,340.6 74 850 121.6 85% 316.0 34 
chro 28 112.4 93% 2,748.8 72 565 98.6 20% 3778 26 
chrto 114 125.4 95% 2,485.2 45 770 106.1 81% 328.2 14 
ohrtt 100 126.1 96% 2,606.4 79 724 1068 81% 346.5 18 
ohrt2 73 128.5 98% 2,958.9 109 m7 107.5 82% 31241 15 
chrt3 56 95.0 99% 2,975.3 74 572 79.4 83% 286.6 12 
cht 56 861 97% 2675.1 66 457 752 85% 361.8 18 
chris 74 765 94% 2,530.2 64 431 635 78% 304.8 10 
chrt6 92 73.0 93% 2,196.8 73 433 e7 78% 305.7 12 
cohrt7 70 748 96% 1,367.1 84 489 618 79% 269.0 13 
chris 4 736 99% 3,309.4 cif 414 64.1 86% 296.5 13 
chrt9 oy) 55.0 99% 1,853.4 44 316 48.2 86% 291.6 16 
chr20 a” 579 97% 3,914.5 76 328 49.8 84% 349.4 18 
ohr2t 36 33.0 94% 1,980.9 125 180 29.4 83% 304.1 os 
ohr22 30 33.4 95% 2,429.5 62 199 298 85% 366.0 14 
Total 4,992 2,598.1 97% 2,403.9 125 15,315 2,191.5 82% 3187 34 


Cov 


lboth assembly and phasing achieved higher contiguity. 
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Genome-wide associations for birth weight and 
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Birth weight (BW) has been shown to be influenced by both fetal 
and maternal factors and in observational studies is reproducibly 
associated with future risk of adult metabolic diseases including 
type 2 diabetes (T2D) and cardiovascular disease'. These life- 
course associations have often been attributed to the impact of an 
adverse early life environment. Here, we performed a multi-ancestry 
genome-wide association study (GWAS) meta-analysis of BW in 
153,781 individuals, identifying 60 loci where fetal genotype was 
associated with BW (P <5 x 10~*). Overall, approximately 15% 
of variance in BW was captured by assays of fetal genetic variation. 
Using genetic association alone, we found strong inverse genetic 
correlations between BW and systolic blood pressure (R, = —0.22, 
P=5.5 x 10-"), T2D (R= —0.27, P=1.1 x 10°) and coronary 
artery disease (R,= —0.30, P=6.5 x 10~*). In addition, using large - 
cohort datasets, we demonstrated that genetic factors were the 
major contributor to the negative covariance between BW and future 
cardiometabolic risk. Pathway analyses indicated that the protein 
products of genes within BW-associated regions were enriched for 
diverse processes including insulin signalling, glucose homeostasis, 
glycogen biosynthesis and chromatin remodelling. There was also 
enrichment of associations with BW in known imprinted regions 
(P= 1.9 x 1074). We demonstrate that life-course associations 


jitterlinden”*-°7, Charlotta Pisinger!*, Oluf Pedersen" 
icholas J. Wareham‘, Hakon Hakonarson'**"8, Eleanor Davies", 


108.10, Atlan A. Vaag®32.13, 


7g 


between early growth phenotypes and adult cardiometabolic disease 
are in part the result of shared genetic effects and identify some of the 
pathways through which these causal genetic effects are mediated. 

We combined GWAS data for BW from 153,781 individuals rep- 
resenting multiple ancestries from 37 studies across three compo- 
nents (Extended Data Fig. | and Supplementary Table 1): (i) 75,891 
individuals of European ancestry from 30 studies; (ii) 67,786 individuals 
of European ancestry from the UK Biobank; and (iii) 10,104 individuals 
of diverse ancestries (African American, Chinese, Filipino, Surinamese, 
‘Turkish and Moroccan) from six studies. Within each study, BW was 
Z-score transformed separately in males and females after excluding 
non-singletons and premature births and adjusting for gestational age 
where available. Genotypes were imputed using reference panels from 
the 1000 Genomes (1000G) Project* or combined 1000G and UK10K 
projects’ (Supplementary Table 2). We performed quality control 
assessments to confirm that the distribution of BW was consistent 
across studies, irrespective of the data collection protocol, and 
confirmed that self-reported BW in the UK Biobank showed genetic 
and phenotypic associations consistent with those seen for measured 
BW in other studies* (Methods). 

We identified 60 loci (of which 59 were autosomal) associated with 
BW at genome-wide significance (P <5 x 10~*) in either the European 


‘A listofafliations appears inthe online version ofthis paper. 
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ancestry or trans-ancestry meta-analyses (Extended Data Fig. 2a, 
Extended Data Table 1a and Supplementary Data; Methods). For lead 
single nucleotide polymorphisms (SNPs), we observed no heteroge- 
neity in allelic effects between the three study components (Cochran's 
Qstatistic P > 0.00083) (Supplementary Table 3), We found that 53 of 
these loci were novel in that the lead SNP mapped >2 Mb away from, 
and was independent (R* < 0.05 in the European (EUR) component 
of 1000G) of, the seven previously reported BW signals’, all of 
which were confirmed in this larger analysis (Supplementary Table 
4). Approximate conditional analysis in the European ancestry data 
indicated that three of these novel loci (near ZBTB7B, HMGA1 and 
PTCHI) harboured multiple distinct association signals that attained 
genome-wide significance (P< 5 x 10~*) (Methods, Supplementary 
Table 5 and Extended Data Fig. 3). 

‘The lead variants for most signals mapped to non-coding sequences, 
and at only two loci, ADRBI (187076938; R?=0.99 with ADRBI G389R) 
and NRIPI (rs2229742, R448G), did the association data point to 
potential causal non-synonymous coding variants (Supplementary 
‘Table 6 and Methods). Lead SNPs for all but two loci (those map- 
ping near YKT6-GCK and SUZ12P1-CRLF3) were common (minor 
allele frequency (MAF) > 5%) with individually modest effects on 
BW (8 =0.020-0.053 standard deviations (s.d.) perallele, equivalent 
to 10-26g). This was despite the much-improved coverage of low- 
frequency variants in this study (compared to previous HapMap 2 
imputed meta-analyses, ref. 5) reflecting imputation from larger, 
and more complete, reference panels (Extended Data Table 1b). 
Indeed, all but five of the common variant association signals were 
tagged by variants (EUR R? > 0.6) in the HapMap 2 reference panel 
(Supplementary Tables 4, 5), indicating that most of the novel discov- 
eries in the present study were driven by increased sample size’. Fine- 
mapping analysis yielded 14 regions in which fewer than ten variants 
contributed to the locus-specific credible sets that accounted for >99% 
of the posterior probability of association (Methods and Supplementary 
Table 7). The greatest refinement was at YKT6-GCK, where the 
credible set included only the low frequency variant 5138715366, 
which maps intronic to YKT6. These credible-set variants collectively 
showed enrichment for overlap with DNasel hypersensitivity sites, 
particularly those generated, by ENCODE, from fetal (4.2-fold, 95% 
CI1.8-10.7) and neonatal tissues (4.9-fold, 1.8-11.0) (Supplementary 
Fig. 1, Supplementary Table 8 and Methods). 

In combination, the 62 distinct genome-wide significant signals 
at the 59 autosomal loci explained at least 2.0 + 1.1% (standard error 
(se.)) of variance in BW (Supplementary Table 9 and Methods), which 
is similar in magnitude to that attributable to sex or maternal body mass 
index (BMI)°. However, the variance in BW captured collectively by 
all autosomal genotyped variants on the array was considerably larger, 
estimated at 15.1 = 0.9% in the UK Biobank (Methods). These figures 
are consistent with a large number of genetic variants with smaller 
effects contributing to variation in BW. 

Associations between fetal genotype and BW could result from 
indirect effects of the maternal genotype influencing BW via the 
intrauterine environment, given the correlation (R ~ 0.5) between 
maternal and fetal genotype. However, two lines of evidence indicated 
that variation in the fetal genome was the predominant driver of 
BW associations. First, an analysis of the global contribution of 
maternal versus fetal genetic variation, using a maternal genome- 
wide complex trait analysis (GCTA) model (ref. 6) (Methods) applied 
to 4,382 mother-child pairs, estimated that the child’s genotype 
(o¢°=0.24+0.11) made a larger contribution to BW variance than 
either the mother’s genotype (a7 = 0.04 + 0.10), or the covariance 
between the two (acy, = 0.04 + 0.08). Second, when we compared 
the point estimates of the BW-effect size dependent on maternal 
genotype at each of the 60 loci (as measured in up to 68,254 women’) 
with those dependent on fetal genotype (using European ancestry data 
from 143,677 individuals in the present study), fetal variation had a 
greater impact than maternal variation at 93% of the loci (55 out of 60; 


© 2016 Macmillan Publi 


Limited, part of Springes 


LETTER 


binomial P= 10~'') (Supplementary Table 10, Extended Data Figs 4,5 
and Methods). The power to further disentangle maternal and fetal 
contributions using analyses of fetal genotype which were conditional 
on maternal genotype was constrained by the limited sample size 
available (n= 12,909 mother-child pairs) (Supplementary Table 11). 

Collectively, these analyses provide evidence that the fetal genotype 
has a substantial impact on early growth, as measured by BW. We 
used these genetic associations to understand the causal relationships 
underlying observed associations between BW and disease, and to 
characterize the processes responsible. 

‘To quantify the shared genetic contribution to BW and other 
health-related traits, we estimated their genetic correlations 
using linkage-disequilibrium score regression’ (Methods). BW. 
(in European ancestry samples) showed strong positive genetic 
correlations with anthropometric and obesity-related traits 
including birth length (R,=0.81, P= 2.0 x 10~*) and, in adults, 
height (R,=0.41, P=4.8 x 10°™), waist circumference (R,=0.18, 
P=3.9%10-'°) and BMI (Rg=0.11, P=7.3 x 10~°). By contrast, 
BW showed inverse genetic correlations with indicators of adverse 
metabolic and cardiovascular health including coronary artery disease 
(CAD, R,= —0.30, P=6,5 x 10~*), systolic blood pressure (SBP, R,. 
—0.22, P=5.5 x 10-9) and T2D (Rg= —0.27, P= 1.1  10~®) (Fig. 1, 
Supplementary Table 12). The correlations between BW and adult car- 
diometabolic phenotypes are of similar magnitude, although direction- 
ally opposite, to the reported genetic correlations between adult BMI 
and those same cardiometabolic outcomes*. These findings support 
observational associations between a history of paternal T2D and lower 
BW (ref. 4), and establish more generally that the observed life-course 
associations between early growth and adult disease, at least in part, 
reflect the impact of shared genetic variants that influence both sets 
of phenotypes. 

In an effort to estimate the extent of genetic contribution to these 
life-course associations, we first focused on data from the UK Biobank 
(1=57,715). For many of the traits for which data were available, 
genetic variation contributed substantially to the life-course relation- 
ship between BW and adult phenotypes, and in some cases appeared 
to be the major source of covariance between the traits. For example, 
we estimated that 85% (95% CI = 70-99%) of the negative covariance 
between BW and SBP was explained by shared genetic associations cap- 
tured by directly genotyped SNPs (Supplementary Table 13, Methods 
and Supplementary Fig. 2). For continuous cardiometabolic measures, 
including lipids and fasting glycaemia, for which measures are not cur- 
rently available in the UK Biobank, we used data from the Northern 
Finland Birth Cohort (n= 5,009), and obtained similar results 
(Supplementary Table 13). However, these estimates were limited, not 
only by wide confidence intervals, but also by the assumption of a lin- 
ear relationship between BW and each of the phenotypes and by the 
inability to explicitly model maternal genotypic effects. In other words, 
the inverse genetic correlations between BW and cardiometabolic traits 
may not exclusively reflect genetic effects mediated directly through 
the offspring, but also effects mediated by maternal genotype acting 
indirectly on the fetus via perturbation of the in utero environment. 
Nevertheless, these estimates indicate that a substantial proportion of 
the variance in cardiometabolic risk that correlates with BW can be 
attributed to the effects of common genetic variation. 

‘To elucidate the biological pathways and processes underlying regu- 
lation of fetal growth, we first performed gene set enrichment analysis 
of our BW GWAS analysis using MAGENTA (Meta-Analysis Gene- 
set Enrichment of variaNT Associations, ref. 9) approach (Methods). 
‘Twelve pathways reached study-wide significance (false discovery rate, 
EDR < 0.05), including pathways involved in metabolism (insulin 
signalling, glycogen biosynthesis and cholesterol biosynthesis), 
growth (IGF signalling and growth hormone pathway) and devel- 
opment (chromatin remodelling) (Extended Data Table 2a). Similar 
pathways were detected in a complementary analysis in which we 
analysed empirical protein-protein interaction (PPI) data identifying 
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13 PPI network modules with marked (Z score > 5) enrichment for 
BW-association scores (Extended Data Table 2b, Extended Data 
Fig. 6a, b and Methods). The proteins within these modules were them- 
selves enriched for diverse processes related to metabolism, growth and 
development (Extended Data Fig, 6a, b). 

We also observed enrichment of BW association signals across the 
set of 77 imprinted genes defined by the Genotype-Tissue Expression 
(GTEx) project (ref. 10) (P=1.9 x 10-4; Extended Data Table 2a and 
Supplementary Table 14). Such enrichment is consistent with the 
‘parental conflict’ hypothesis regarding the allocation of maternal 
resources to the fetus!', Although the role of imprinted genes in 
fetal growth has been described in animal models and rare human 
disorders", these data provide a large-scale, systematic indication 
of their contribution to normal variation in BW. Of the 60 genome- 
wide significant loci, two (INS-IGF2 and RB1) fall within (or near) 
imprinted regions (Extended Data Fig. 2b), with a noteworthy third 
signal at DLK1 (previously fetal antigen-1; P=5.6 x 10~*). Parent- 
of-origin specific analyses to further investigate these individual loci 
(comparing heterozygote versus homozygote BW variance in 57,715 
unrelated individuals, and testing BW associations with paternal versus 
maternal alleles in 4,908 mother-child pairs; see Methods) proved, 
despite these sample sizes, to be underpowered (Extended Data Fig. 7 
and Supplementary Tables 15, 16). 

Many of the genome-wide signals for BW detected here are also 
established genome-wide association signals for a wide variety of 
cardiometabolic traits (Fig. 2). These include the BW signals near 
CDKAL1, ADCYS, HHEX-IDE and ANK1 (also genome-wide 
significant for T2D), NT5C2 (for blood pressure, CAD and BMI) and 
ADRBI (for blood pressure). We used two approaches to understand 
whether this pattern of adult trait association represented a generic 
property of BW-associated loci or reflected heterogeneous mechanisms 
linking BW to adult disease. 

First, we applied unsupervised hierarchical clustering (Methods) to 
the non-BW trait association statistics for the 60 significant BW loci, 
The resultant heat map showed the heterogeneity of locus-specific effect 
sizes across the range of adult traits (Fig. 2 and Supplementary Table 17). 
For example, it revealed that the associations between BW-raising 
alleles and increased adult height are concentrated amongst a subset 
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=] Figure 1 | Genome-wide genetic correlation between BW and 
a range of traits and diseases in later life. Genetic correlation 
(Rg) and corresponding s.e. (error bars) between BW and the 
traits displayed on the x axis were estimated using linkage 
disequilibrium score regression (ref. 8). The genetic correlation 
estimates (R,) are colour coded according to their intensity and 
direction (red for positive and blue for inverse correlation). 
WHRadjBMI, waist-hip ratio adjusted for body mass index; 
HOMA-B/IR, homeostasis model assessment of beta-cell 
function/insulin resistance; HbA1e, haemoglobin Alc; BMD, bone 
mineral density; ADHD, attention deficit hyperactivity disorder. 
See Supplementary Table 12 for references for each of the traits 
and diseases displayed. 


of loci including HHIP and GNA12, and highlighted particularly strong 
associations with lipid traits for variants at the TRIBI and MAFB loci. 

Second, we constructed trait-specific ‘point-of-contact' (PoC) PPI 
networks from proteins represented in both the global BW PPI network 
and equivalent PPI networks generated for each of the adult traits 
(Methods and Extended Data Figs 6c-e). We reasoned that these PoC 
PPI networks would be enriched for the specific proteins mediating the 
observed links between BW and adult traits, generating hypotheses that 
are amenable to subsequent empirical validation. To highlight processes 
implicated in specific BW-trait associations, we overlaid these PoC PPI 
with the top 50 pathways that were over-represented in the global BW 
PPI network. These analyses revealed, for example, that proteins in 
the Wnt canonical signalling pathway were detected in the PoC PPI 
network only for blood pressure traits. We used these PPI overlaps to 
highlight the specific transcripts within BW GWAS loci that were likely 
to mediate the mechanistic links, For example, the overlap between the 
Wot signalling pathway and the PoC PPI network for the intersection 
of BW and blood pressure-related traits implicated FZD9 as the likely 
effector gene at the MLXIPL BW locus (Extended Data Fig. 6d and 
Supplementary Table 6). 

We focused our more detailed investigation of the mechanistic links 
between early growth and adult traits on two phenotypic areas: arterial 
blood pressure and T2D/glycaemia. Across both the overall GWAS and 
specifically among the 60 significant BW loci, most BW-raising alleles 
‘were associated with reduced blood pressure (Figs 1, 2); the strongest 
inverse associations were seen for the loci near NT5C2, FES, NRIPI, 
EBF1 and PTHIR. However, we also observed locus-specific hetero- 
geneity in the genetic relationships between blood pressure and BW: 
the SBP-raising allele at ADRB1™ is associated with higher, rather than 
lower, BW (Extended Data Fig. 8a). When we considered the reciprocal 
relationship, that is, the effects on BW of blood-pressure-raising alleles 
at 30 reported loci for SBP'*"*, there was an excess of associations 
(5 out of 30 with lower BW at P< 0.05; binomial P=0.0026; Extended 
Data Fig, 8a). To dissect maternal and fetal genotype effects at these 
loci, we tested the impact on BW ofa risk score generated from the 30 
SBP SNPs, restricted to the untransmitted maternal haplotype score! 
in aset of 5,201 mother-child pairs. Analysis of these loci indicated that 
maternal genotype effects on the intrauterine environment probably 
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Figure 2 | Hierarchical clustering of BW loci based on similarity of 
overlap with adult diseases, metabolic and anthropometric traits. For 
the lead SNP at each BW locus (x axis), Z scores (aligned to BW-raising 
allele) were obtained from publicly available GWAS for various traits 
(yaxis; see Supplementary Table 17). A positive Z score (red) indicates a 
positive association between the BW-raising allele and the outcome trait, 


contribute to the inverse genetic correlation between SBP and BW 
(Methods and Supplementary Table 18), and was consistent with 
the results of a wider study of >30,000 women which demonstrated 
associations between a maternal genetic score for SBP (conditional on 
fetal genotype) and lower offspring BW'*. 

The blood-pressure-raising allele with the largest BW-lowering 
effect mapped to the NT5C2 locus (index variant for BW, 1874233809, 
R’=0.98 with index variant for blood pressure, rs11191548; ref. 14) and 
was also associated with lower adult BMI (R? = 0.99 with rs11191560; 
ref. 17). The BW-lowering allele at rs74233809 is a proxy for a recently 
described’ functional variant in the nearby CYP17A1 gene (R?=0.92 
with rs138009835). The CYPI7A1 gene encodes the cytochrome 
P450cl7a enzyme CYP17 (ref. 19), which catalyses key steps in steroi- 
dogenesis that determine the balance between mineralocorticoid, 
glucocorticoid and androgen synthesis. This variant has been shown to 
alter transcriptional efficiency in vitro and is associated with increased 
urinary tetrahydroaldosterone excretion’. CYP17A1 is expressed in 
fetal adrenal glands and testes from early gestation” as well as in the 
placenta”, These data suggest that variation in CYP17A1 expression 
contributes to the observational association between low BW and adult 
hypertension, 

‘When we analysed 45 loci associated with CAD®, the inverse genetic 
correlation between CAD and BW was concentrated amongst the five 
CAD loci with primary blood pressure associations. This suggests that 
genetic determinants of blood pressure play a leading role in mediating 
the life-course associations between BW and CAD (Extended Data 
Fig. 8b, e). 

Linkage-disequilibrium score regression analyses demonstrated 
overall inverse genetic correlation between lower BW and elevated risk 
of T2D (Fig. 1). However, the locus-specific heat map indicates a heter- 
ogeneous pattern across individual loci (Fig. 2). To explore this further, 
we tested the 84 reported T2D loci™ for association with BW. Some T2D 
risk alleles (such as those at ADCY5, CDKAL1 and HHEX-IDE) were 
strongly associated with lower BW, while others (including ANK1 and 
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MTNRIB) were associated with higher BW (Extended Data Fig. 8c). 
‘This was in contrast with the BW effects of 422 known height loci”® 
(Extended Data Fig. 8d), which showed a strong positive correlation 
consistent with the overall genetic correlation between height and BW, 
indicating that the growth effects of many height loci start prenatally 
and persist into adulthood. 

‘The contrasting associations of T2D-risk alleles with both higher 
and lower BW probably reflect the differential impacts, across loci, 
of variation in the maternal and fetal genomes. Observational data 
link paternal diabetes with lower offspring BW’, indicating that the 
inheritance of T2D risk alleles by the fetus tends, in line with the 
linkage-disequilibrium score regression analysis, to reduce growth. 
‘These relationships are consistent with the precepts of the ‘fetal insulin 
hypothesis”* and reflect the potential for reduced insulin secretion and/ 
or signalling to lead to both reduced fetal growth and, many decades 
later, enhanced predisposition to T2D. In line with this, the inferred 
paternal transmitted haplotype score generated from the 84 T2D risk 
variants was associated with lower BW (P=0.045) in 5,201 mother- 
child pairs (Methods and Supplementary Table 18). In contrast, 
maternal diabetes is observationally associated with higher offspring 
BW‘, reflecting the ability of maternal hyperglycaemia to stimulate fetal 
insulin secretion. The contribution of genotype-dependent maternal 
hyperglycaemia to BW is in line with the evidence, from a recent study, 
that maternal genotype scores for fasting glucose and T2D (conditional 
on fetal genotype) were causally associated with higher offspring BW'®, 
Itis also consistent with the observation that a subset of glucose-raising 
alleles is associated with higher BW’, For example, the T2D-risk variant 
at MTNRIB (which also has a marked effect on fasting glucose levels 
in non-diabetic individuals”) was amongst the subset of BW loci 
(5 out of 60) for which the BW effect attributable to maternal genotype 
exceeded that associated with the fetal genotype (maternal: J =0.048, 
P=5.1x 10-}; fetal: G= 0.023, P= 2.9 x 10-*) (Supplementary 
Table 10 and Extended Data Figs 4, 5). Thus, both maternal and fetal 
genetic effects connect BW to later T2D risk, albeit acting in opposing 
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directions. When we categorized T2D loci using a classification of 
physiological functions derived from their effects on related glycaemic 
and anthropometric traits”, we found that T2D-risk alleles associated 
with lower BW were those typically characterized by reduced insulin 
processing and secretion without detectable changes in fasting glucose 
(the ‘Beta Cell’ cluster in Extended Data Fig. 8f). 

‘The YTKG6 signal at rs138715366 is notable not only because the 
genetic data indicate that a single low-frequency non-coding variant 
is driving the association signal (see above) but also because of the 
proximity of this signal to GCK. Rare coding variants in glucoki- 
nase are causal for a form of monogenic hyperglycaemia and lead 
to large reductions in BW when parental alleles are passed on to 
their offspring”. In addition, common non-coding variants nearby 
are implicated in T2D risk and fasting hyperglycaemia”*, However, 
the latter variants are conditionally independent of rs138715366 
(Supplementary Table 19) and show no comparable association with 
lower BW, Either rs138715366 acts through effector transcripts other 
than GCK, or the impact of the low-frequency SNP near YKT6 on 
GCK expression involves tissue- and/or temporal-specific variation in 
regulatory impact. 

In conclusion, we have identified 60 genetic loci associated with BW 
and used them to gain insights into the aetiology of fetal growth and 
into well-established, but until now poorly understood, life-course 
disease associations. The evidence that the relationship between early 
growth and later metabolic disease has an appreciable genetic com- 
ponent contrasts with, but is not necessarily incompatible with, the 
emphasis on adverse early environmental events highlighted by the 
fetal origins hypothesis". As we have shown, these genetic effects reflect 
variation in both the fetal and the maternal genome: the impact of 
the latter on the offspring’s predisposition to adult disease could be 
mediated, at least in part, through perturbation of the antenatal and 
early life environment. Future mechanistic and genetic studies should 
support reconciliation between these alternative, but complementary, 
explanations for the far-reaching life-course associations that exist 
between events in early life and predisposition to cardiometabolic 
disease several decades later. 


Online Content Methods, slong with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only inthe online paper. 
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METHODS 

Ethics statement. All human research was approved by the relevant institutional 
review boards and conducted according to the Declaration of Helsinki, All par- 
ticipants provided written informed consent. Ethical approval for the study was 
obtained from the ALSPAC Ethics and Law Committee and the local Research 
Ethics Committees 

Study-level analyses. No statistical methods were used to predetermine sample 
size: to maximise power to detect association signals, we set out to collect the largest 
possible set of samples for which the combination of genome-wide genotyping 
data and reliable measures of BW could be made available for analysis. Within 
each study, BW was collected from a variety of sources, including measurements 
at birth by medical practitioners, obstetric records, medical registers, interviews 
with the mother and self-report as adults (Supplementary Table 1). BW was 
Z-score transformed separately in males and females, Individuals with extreme BW 
(25s. from the sex-specific study mean), monozygotic or polyzygotic siblings, or 
preterm births (gestational age <37 weeks, where this information was available) 
‘were excluded from downstream association analyses (Supplementary Table 1). 

Within each study, stringent quality control of the GWAS genotype scaffold was 
carried out before imputation (Supplementary Table 2). Each scaffold was then 
pre-phased and imputed" up to reference panels from the 1000G project” or the 
combined 1000G and UK10K projects? (Supplementary Table 2). Association of 
BW with each variant passing established GWAS quality control filters was tested 
ina linear regression framework, under an additive model for the allelic effect, after 
adjustment for study-specific covariates, including gestational age, where available 
(Supplementary Table 2). Where necessary, population structure was accounted for 
by adjustment for axes of genetic variation from principal components analysis 
and subsequent genomic control correction, or inclusion of a genetic relationship 
matrix in a mixed model™ (Supplementary Table 2). We calculated the genomic 
control inflation factor () in each study to confirm that study-level population 
structure was accounted for before meta-analysis. 

Preparation, quality control and genetic analysis in UK Biobank samples. UK 
Biobank phenotype data were available for 502,655 participants*. All participants 
in the UK Biobank were asked to recall their BW, of which 279,971 did so at either 
the baseline or follow-up assessment visit. Of these, 7,686 participants reported 
being part of multiple births and were excluded from downstream analyses. 
Ancestry checks, based on self-reported ancestry, resulted in the exclusion of 8,998 
additional participants reported not to be white European. Of those individuals 
reporting BW at baseline and follow-up assessments, 393 were excluded because 
the two reported values differed by more than 0.5kg, For those reporting different 
values (<0.5kg) between baseline and follow-up, we took the baseline measure 
forward for downstream analyses, We then excluded 36,716 individuals reporting 
values <2.5kg or >4.5kg as implausible for live term births before 1970. In total 
226,178 participants had data relating to BW that matched these inclusion criteria, 

Genotype data from the May 2015 release were available for a subset of 152,249 
participants from UK Biobank. In addition to the quality control metrics performed 
centrally by UK Biobank, we defined a subset of white European’ ancestry samples 
using a K-means (K=4) clustering approach based on the first four genetically 
determined principal components. A maximum of 67,786 individuals (40,425 
females and 27,361 males) with genotype and valid BW measures were available 
for downstream analyses. We tested for association with BW, assuming an additive 
allelic effect, in a linear mixed model implemented in BOLT-LMM (ref, 37) to 
account for cryptic population structure and relatedness. Genotyping array was 
included as a binary covariate in all models. Total chip heritability (that is, the 
variance explained by all autosomal polymorphic genotyped SNPs passing quality 
control) was calculated using restricted maximum likelihood (REML) imple- 
‘mented in BOLT-LMM (ref, 37). We additionally analysed the association between 
BW and directly genotyped SNPs on the X chromosome: for this analysis, we used 
57,715 unrelated individuals with BW available and identified by UK Biobank as 
‘white British, We excluded SNPs with evidence of deviation from Hardy-Weinberg 
equilibrium (P< 1 x 10-§), MAF < 0.01 or overall missing rate >0.015, resulting 
in 19,423 SNPs for analysis in Plink v1.07 (http://pngu.mgh harvard.edu/purcell/ 
plink/)*, with the first five ancestry principal components as covariates. 

In both the full UK Biobank sample and our refined sample, we observed that 
BW was associated with sex, year of birth and maternal smoking (P< 0.0015, all in 
the expected directions), confirming more comprehensive previous validation of 
self-reported BW’, We additionally verified that BW associations with lead SNPs 
at seven established loci® based on self-report in UK Biobank were consistent with 
those previously published 
European ancestry meta-analysis. The European ancestry meta-analysis consisted 
of two components: (i) 75,891 individuals from 30 GWAS from Europe, USA 
and Australia; and (i) 67,786 individuals of white European origin from the 
UK Biobank. In the first component, we combined sex-specific BW association 


summary statistics across studies ina fixed/-effects meta-analysis, implemented in 
GWAMA (ef. 39) and applied a second round of genomic control (age = 1.001), 
Subsequently, we combined association summary statistics from this component 
with the UK Biobank in a European ancestry fixed-effects meta-analysis, imple- 
mented in GWAMA (ref. 39). Variants failing GWAS quality control filters in 
the UK Biobank, reported in less than 50% of the total sample size in the first 
component, or with MAF <0,19, were excluded from the European ancestry 
meta-analysis. We aggregated X-chromosome association summary statistics from 
the UK Biobank (19,423 SNPs) with corresponding statistics from the European 
GWAS component using fixed effects P-value-based meta-analysis in METAL 
(ef. 40) (max n= 99,152). 

‘We were concerned that self-reported BW as adults in the UK Biobank would 
not be comparable with that obtained from more stringent collection methods used 
in other European ancestry GWAS. In addition, the UK Biobank lacked informa- 
tion on gestational age for adjustment, which could have an impact on strength 
‘of association compared with the results obtained from other European ancestry 
GWAS. However, we observed no evidence of heterogeneity in BW allelic effects 
at lead SNPs between the two components of European ancestry meta-analysis, 
using Cochran's Q statistic implemented in GWAMA (ref, 39) after Bonferroni 
correction (P > 0.00083) (Supplementary Table 3). We tested for heterogeneity in 
allelic effects between studies within the European component using Cochran’ Q. 
‘At loci demonstrating evidence of heterogeneity, we confirmed that association 
signals were not driven by outlying studies by visual inspection of forest plots. 
We performed sensitivity analyses to assess the impact of covariate adjustment 
(gestational age and population structure) on heterogeneity 

‘We were also concerned that averlap of individuals (duplicated or related) 
between the two components of the European ancestry meta-analysis might lead 
to false positive association signals. We performed bivariate linkage disequilibrium 
score regression’ using the two components of the European ancestry meta-analysis 
and observed a genetic covariance intercept of 0.0156 + 0.0058 (s..), indicating 
a maximum of 1,119 duplicate individuals, Univariate linkage-disequilibrium 
score regression of the European ancestry meta-analysis estimated the intercept 
as 1.0426, which may indicate population structure or relatedness that was not 
adequately accounted for in the analysis. To assess the impact ofthis inflation on 
the European ancestry meta-analysis, we expanded the standard errors of BW 
allelic effect size estimates and re-calculated association P values, On the basis of 
this adjusted analysis, only the lead SNP at MTNRIB dropped below genome-wide 
‘significance (rs10830963, P=5.5 x 10-*). 

‘Trans-ancestry meta-analysis. The trans-ancestry meta-analysis combined the 
two European ancestry components with an additional 10,104 individuals from 
six GWAS from diverse ancestry groups: African American, Chinese, Filipino, 
Surinamese, Turkish and Moroccan. Within each GWAS, we first combined 
sex-specific BW association summary statistics ina fixed-effects meta-analysis, 
implemented in GWAMA (ref. 39) and applied a second round of genomic 
control", Subsequently, we combined association summary statistics from the 
six non-European GWAS and the two European ancestry comportents ina trans- 
ancestry fixed-eflects meta-analysis, implemented in GWAMA (ref 39). Variants 
failing GWAS quality control filters in the UK Biobank, reported in less than 50% 
of the total sample size in the first component, or with MAF <0.1%, were excluded 
from the trans-ancestry meta-analysis, We tested for heterogeneity in allelic effects 
between ancestries using Cochran’ Q (ref. 41). 

Approximate conditional analysis. We searched for multiple distinct BW 
association signals in each of the established and novel loci, defined as 1 Mb 
up- and down-stream of the lead SNP from the trans-ancestry meta-analysis, 
through approximate conditional analysis. We applied GCTA (ref. 42) to identify 
‘index SNPs for distinct association signals attaining genome-wide significance 
(P-<5x 10-4) in the European ancestry meta-analysis using a reference sample 
‘£5,000 individuals of white British origin, randomly selected from the UK 
Biobank, to approximate patterns of linkage disequilibrium between variants in 
these regions. Note that we performed approximate conditioning on the basis of 
only the European ancestry meta-analysis because GCTA cannot accommodate 
linkage-disequilibrium variation between diverse populations. 

Prioritizing candidate genes in each BW locus. We combined a number of 
approaches to prioritize the most likely candidate gene(s) in each BW locus. 

Expression quantitative trait loci (eQTLs) were obtained from the Genotype 
Tissue Expression (GTEx) Project", the GEUVADIS project" and eleven other 
studies! using HaploReg v4 (ref. 56). We interrogated coding variants for each 
BW lead SNP and its proxies (EUR R? > 0.8) using Ensembl!” and HaploReg, Their 
likely functional consequences were predicted by SIFT (ref. 58) and PolyPhen2 
(ref. $9). Biological candidacy was assessed by presence in significantly enriched 
{gene set pathways from MAGENTA analyses (see below for details). We extracted 
all genes within 300 kb ofall lead BW SNPs and searched for connectivity between 
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any genes using STRING (ref, 60). If two or more genes between two separate BW 
loci were connected, they were given an increased prior for both being plausible 
candidates, We also applied protein-protein interaction (PPI) analysis (see below 
for details) to all genes within 300kb of each lead BW SNPs and ranked the genes, 
based on the score for connectivity with the surrounding genes. 

Evaluation of imputation quality of the low-frequency variant at the YKT6- 
GCK locus. At the YKT6-GCK locus, the lead SNP (18138715366) was found at 
alow frequency in European ancestry populations (MAF =0,92%) and was even 
rarer in other ancestry groups (MAF = 0.23% in African Americans, otherwise 
monomorphic) and was not present in the HapMap reference panel*', To assess the 
accuracy of imputation for this low-frequency variant, we genotyped rs138715366 
in the Northern Finland Birth Cohort (NFBC) 1966 (Supplementary Table 1). OF 
the 5,009 samples in the study, 4,704 were successfully imputed and genotyped 
(or sequenced) for rs138715366, The overall concordance rate between imputed 
and directly assayed genotypes was 99.8% and for directly assayed heterozygote 
calls was 75.0%, 

Fine-mapping analyses. We investigated linkage-disequilibrium differences 
between populations contributing to the trans-ancestry meta-analysis and to take 
advantage of the improved coverage of common and low-frequency variation 
offered by 1000G or 1000G and UKI0K combined imputation to localize variants 
driving each distinct association signal achieving locus-wide significance. For 
each distinct signal, we used MANTRA (ref, 62) to construct 99% credible sets of 
variants that together account for 99% of the posterior probability of driving the 
association, MANTRA incorporates a prior model of relatedness between studies, 
based on mean pair-wise allele frequency differences across loci, to account for 
heterogeneity in allelic effects (Supplementary Table 3). MANTRA has been 
demonstrated, by simulation, to improve localization of causal variants compared 
with either a fixed- or random-effects trans-ancestry meta-analysis"! 

For loci with only one signal of association, we used MANTRA to combine 
summary statistics ftom the six non-European GWAS and the two European 
ancestry components. However, for loci with multiple distinct association signals, 
‘we used MANTRA to combine summary statistics ftom approximate conditioning 
for the two European components, separately for each signal 

For each distinct signal, we calculated the posterior probability that the jth 
‘variant, cp is driving the association, given by 


where the summation is over all variants mapping within the (conditional) 
meta-analysis across the locus. In this expression, A, is the Bayes’ factor in 
favour of association from the MANTRA analysis, A 99% credible set® was then 
constructed by: (i) ranking all variants according to their Bayes’ facto 
(ii) including ranked variants until their cumulative posterior probabilit 
exceeds 0.99. 

Genomic annotation. We used genomic annotations of DNasel hypersensitive 
sites (DHS) from the ENCODE (ref. 65 project and protein coding genes from 
GENCODE (ref. 66). We filtered cell types that are cancer cell lines (karyotype 
‘cancer’ from https://genome.uesc.edu/ENCODE/cell Types html), and merged data 
from multiple samples from the same cell type. This resulted in 128 DHS cell-type 
annotations, as well as 4 gene-based annotations (coding exon, 5/UTR, 3/UTR and 
1 kb upstream of the transcription start site (TSS)). First, we tested for the effect of 
cach cell type DHS and gene annotation individually using the Bayes' factors for 
all variants in the 62 credible sets using fgwas (ref. 67). Second, we categorized the 
annotations into ‘genic, ‘fetal DHS; ‘embryonic DHS, ‘stem cell DHS; ‘neonatal 
DHS’ and ‘adult DHS' based on the description fields from ENCODE, and tested 
for the effect of each category individually as described above using fgwas. Third, 
‘we then tested the effect of each category by including all categories ina joint model 
using fgwas. For each of the three analyses, we obtained the estimated effects and 
95% confidence intervals (Cl) for each annotation, and considered an annotation 
enriched if the 95% CI did not overlap zero, 

Estimation of genetic variance explained, ‘The ‘variance explained! statistic 
was calculated using the REML method implemented in GCTA (ref. 68). We 
considered the variance explained by two sets of SNPs: (i) lead SNPs of all 62 
distinct association signals at the 59 established and novel autosomal BW loci 
identified in the European-specific or trans-ancestry meta-analyses; (ii) lad SNPs 
of 55 distinct association signals at the 52 novel autosomal BW loci (Extended 
Data Table land Supplementary Table 7). The variance explained’ was calculated 
in samples of European ancestry in the Hyperglycemia and Adverse Pregnancy 
Outcome (HAPO) study® (independent of the meta-analysis) and two studies 
that were part of the European ancestry meta-analysis: NFBC1966 and Generation 
R (Supplementary Table 1). In each study, the genetic relationship matrix was, 
estimated for each set of SNPs and was tested individually against BW (males 


and 
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and females combined) with study specific covariates. These analyses provided 
an estimate and se. for the variance explained by each of the given sets of SNPs. 
‘Examining the relative effects on BW of maternal and fetal genotype at the 60 
identified loci. We performed four sets of analyses. First, we used GWAS data 
from 4,382 mother-child pairs in the Avon Longitudinal Study of Parents and 
Children (ALSPAC) study to fit a‘maternal-GCTA model" to estimate the extent 
to which the maternal genome might influence offspring BW independent of the 
fetal genome. The maternal-GCTA model uses genome-wide genetic similarity 
between mothers and offspring to partition the phenotypic variance in BW into 
components due to the maternal genotype, the child's genotype, the covariance 
between the two and environmental sources of variation, 

Second, we compared associations with BW of the fetal versus maternal 
genotype at each of the 60 BW loci. The maternal allelic effect on offspring BW 
‘was obtained from a maternal GWAS meta-analysis of 68,254 European mothers 
from the EGG Consortium (= 19,626)’ and the UK Biobank (1 = 48,628). In 
the UK Biobank, mothers were asked to report the BW of their first child. Women 
of European ancestry with genotype data available in the May 2015 data release 
were included, and those with reported BW equivalent to <2.Skg or >4.5kg were 
excluded. No information on gestational age or gender of child was available. 
BW of first child was associated with maternal factors such as smoking status, 
BMI and height in the expected directions. Of the 68,254 women included in the 
maternal GWAS, 13% were mothers of individuals included in the current fetal 
European ancestry GWAS, and a further ~45% were themselves (with their own 
BW) included in the fetal GWAS, 

‘Third, we additionally conducted analyses in 12,909 mother-child pairs from 
nine contributing studies: at each of the 60 loci, we compared the effect of the fetal 
genotype on BW adjusted for sex and gestational age, with and without adjust- 
ment for maternal genotype. We reciprocally compared the association between 
the maternal genotype and BW with and without adjustment for fetal genotype. 

Fourth, we used the method of Zhang et al. to test associations between BW 
and the maternal untransmitted, maternal transmitted and inferred paternal 
transmitted haplotype score of 422 height SNPs, 30 SBP SNPs"®"" and 84'T2D 
SNPs“ in 5,201 mother-child pairs from the ALSPAC study. 
Linkage-disequilibrium score regression. The use of linkage-disequilibrium 
score regression to estimate the genetic correlation between two traits/diseases 
has been described in detail elsewhere”. Briefly, the linkage-disequilibrium score 
isa measure of how much genetic variation each variant tags; ifa variant has a 
high linkage-disequilibrium score then itis in high linkage disequilibrium with 
many nearby polymorphisms. Variants with high linkage-disequilibrium scores 
are more likely to contain more true signals and hence provide more chance of 
overlap with genuine signals between GWAS. The linkage-disequilibrium score 
regression method uses summary statistics from the GWAS meta-analysis of BW 
and the other traits of interest, calculates the cross-product of test statistics at each 
SNP, and then regresses the cross-product on the linkage-disequilibrium score. 
Bulik-Sullivan et al.” show that the slope of the regression is a function of the 
‘genetic covariance between traits: 


Rag: 


mY INN 

where Nis the sample size for study i, py is the genetic covariance, Mis the number 
of SNPs in the reference panel with MAF between 5% and 50%, is the linkage- 
disequilibrium score for SNP j, N, quantifies the number of individuals that overlap 
both studies, and p is the phenotypic correlation amongst the N, overlapping 
samples, Thus, ifthere is sample overlap (or cryptic relatedness between samples), 
it will only affect the intercept from the regression (that is, the term“) and 


not the slope, and hence estimates of the genetic covariance will not be biased by 
sample overlap. Likewise, population stratification will affect the intercept but will 
have minimal impact on the slope (that is, intuitively since population stratification 
does not correlate with linkage disequilibrium between nearby markers). 
Summary statistics from the GWAS meta-analysis for traits and diseases of 
interest were downloaded from the relevant consortium website. The summary 
statistics files were reformatted for linkage-disequilibrium score regression 
analysis using the munge_sumstats.py python script provided on the developer's 
website (https://github.com/bulik/ldsc). For each trait, we filtered the summary 
statistics to the subset of HapMap 3 SNPs”, as advised by the developers, to 
ensure that no bias was introduced due to poor imputation quality. Summary 
statistics from the European-specific BW meta-analysis were used because of the 
variable linkage-disequilibrium structure between ancestry groups. Where the 
sample size for each SNP was included in the results file this was flagged using 
N-cok ifno sample size was available then the maximum sample size reported in 
the reference for the GWAS meta-analysis was used. SNPs were excluded for the 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


following reasons: MAF < 0.01; ambiguous strand; duplicate rsID; non-autosomal 
SNPs; reported sample size less than 60% of the total available. Once all files were 
reformatted, we used the Idsc.py python script, also on the developers’ website, to 
calculate the genetic correlation between BW and each of the traits and diseases. 
‘The European linkage-disequilibrium score files calculated from the 1000G. 
reference panel and provided by the developers were used for the analysis. Where 
multiple GWAS meta-analyses had been conducted on the same phenotype (that 
is, over a period of years), the genetic correlation with BW was estimated using 
each set of summary statistics and presented in Supplementary Table 12. The 
phenotypes with multiple GWAS included height, BMI, waist-hip ratio (adjusted 
for BMI), total cholesterol, triglycerides, high density lipoprotein (HDL) and low 
density lipoprotein (LDL). The estimate of the genetic correlation between the 
multiple GWAS meta-analyses on the same phenotype were comparable and the 
later GWAS had a smaller standard error due to the increased sample size, so 
only the genetic correlation between BW and the most recent meta-analyses were 
presented in Fig. 2 

In the published GWAS for blood pressure" the phenotype was adjusted for 
BML. Caution is needed when interpreting the genetic correlation between BW 
and BMI-adjusted SBP owing to the potential for collider bias”. Since BMI is 
associated with both blood pressure and BW, it is possible that the use of a blood 
pressure genetic score adjusted for BMI might bias the genetic correlation estimate 
towards a more negative value. To verily that the inverse genetic correlation with. 
BW (r,=—0.26, .e.=0.05, P=65 x 10-®) was not due to collider bias caused by 
the BMI adjustment of the phenotype, we obtained an alternative estimate using. 
UK Biobank GWAS data for SBP that was unadjusted for BMI and obtained a 
similar result (Ry = —0.22, se. =0.03, P=5.5 x 10). The SBP phenotype in the 
UK Biobank was prepared as follows, Two blood pressure readings were taken at 
assessment, approximately 5 min apart. We included all individuals with an auto- 
mated blood pressure reading (taken using an automated Omron blood pressure 
monitor). Two valid measurements were available for most participants (averaged 
to createa blood pressure variable, or alternatively a single reading was used if only 
‘one was available). Individuals were excluded if the two readings differed by more 
than 4.565, Blood pressure measurements more than 4.56.4, away from the 
mean were excluded. We accounted for blood pressure medication use by adding 
15mm Hg to the SBP measure. Blood pressure was adjusted for age, sex and centre 
location and then inverse rank normalized. We performed the GWAS on 127,698 
individuals of British descent using BOLT-LMM (ref. 37), with genotyping array 
as covariate. 
Estimating the proportion of the BW-adult traits covariance attributable to 
genotyped SNPs. We estimated the phenotypic, genetic and residual correlations 
as well asthe genetic and residual covariance between BW and several quantitative 
traits and/or disease outcomes in the UK Biobank using directly genotyped SNPs 
and the REML method implemented in BOLT-LMM (ref. 37). The traits examined 
included T2D, SBP, diastolic blood pressure, CAD, height, BMI, weight, waist-hip 
ratio, hip circumference, waist circumference, obesity, overweight, age at menarche, 
asthma, and smoking. Where phenotypes were not available (for example, serum 
blood measures are not currently available in the UK Biobank), we obtained 
estimates using the NFBC1966 study (for correlations/covariance between BW and 
triglycerides, total cholesterol, HDL, LDL, fasting glucose and fasting insulin). In 
the UK Biobank analysis, we used 57,715 unrelated individuals with BW available 
and identified by the UK Biobankas white British, SNPs with evidence of deviation, 
from Hardy-Weinberg equilibrium (P< 1 x 10-®), MAF-< 0,05 or overall missing 
rate >0.015 were excluded, resulting in 328,928 SNPs for analysis, We included the 
first five ancestry principal components as covariates. In the NFBC1966 analysis, 
5,009 individuals with BW were enrolled. Genotyped SNPs that passed quality 
control (Supplementary ‘Table 2) were included, resulting in 324,895 SNPs for 
analysis. The first three ancestry principal components and sex were included 
ascovariates. 


Gene set e 


yment analysis, Meta-analysis gene-set enrichment of variant 
associations (MAGENTA) was used to explore pathway-based associations using 
summary statistics from the trans-ancestry meta-analysis, MAGENTA implements 
a gene set enrichment analysis (GSEA) based approach, as previously described” 
Briefly, each gene in the genome was mapped to a single index SNP with the 
lowest P value within a 110kb upstream and 40kb downstream window. This 
Pyalue, representing a gene score, was then corrected for confounding factors 
such as gene size, SNP density and linkage-disequilibrium-related properties in a 
regression model. Genes within the HLA-region were excluded from analysis due 
to difficulties in accounting for gene density and linkage-disequilibrium patterns. 
Each mapped gene in the genome was then ranked by its adjusted gene score. At 
a given significance threshold (95th and 75th percentiles of all gene scores), the 
observed number of gene scores in a given pathway, with a ranked score above 
the specified threshold percentile, was calculated. This observed statistic was 


then compared to 1,000,000 randomly permuted pathways of identical size. This 
{generates an empirical GSEA P value for each pathway. Significance was attained 
‘when an individual pathway reached a FDR <0.05 in either analysis. In total, 3,216 
pre-defined biological pathways from Gene Ontology, PANTHER, KEGG and 
Ingenuity were tested for enrichment of multiple modest associations with BW. 
‘The MAGENTA software was also used for enrichment testing of custom gene sets, 

Protein-protein interaction network analyses. We used the integrative 
protein-interaction-network-based pathway analysis (iPINBPA) method’. Briefly, 
‘we generated gene-wise P values from the trans-ancestry meta-analysis using 
VEGAS? (ref. 74), which mapped the SNPs to genes and accounted for possible 
confounders, such as linkage-disequilibrium between markers. The empirical 
_gene-wise P values were calculated using simulations from the multivariate normal 
distribution. Those that were nominally significant (P< 0.01) were selected as 
‘seed genes, and were collated within a high confidence version of inweb3 
(ref. 75) to weight the nodes in the network following a guilt-by-association 
approach, In a second step, a network score was defined by the combination of 
the Z scores derived from the gene-wise P values with node weights using the 
Liptak-Stouffer method’®, A heuristic algorithm was then applied to extensively 
search for modules enriched in genes with low P values. The modules were 
further normalized using a null distribution of 10,000 random networks. Only 
those modules with Z score > 5 were selected. Finally, the union of all modules 
constructed a BW-overall PPI network. Both the proteins on the individual 
modules and on the overall BW-PPI were interrogated for enrichment in Gene 
Ontology terms (biological processes) using a hypergeometric test. Terms were 
considered as significant when the adjusted P value, following the Benjamini— 
Hochberg procedure, was below 0.05. 

Point of contact analyses. ‘The same methodology described above was applied. 
to 16 different adult traits resulting in a number of enriched modules per trait. 
Different modules for each trait were combined in a single component and the 
intersection between these trait-specific components and the BW component 
was calculated. This intersection was defined as the PoC network. We used the 
resulting PoC networks in downstream analyses to interrogate which set of proteins 
connected BW variation and adult trait variation via pathways enriched in the 
overall BW analysis. 

Parent-of-origin specific associations, We first searched for evidence of 
pparent-of-origin effects in the UK Biobank samples by comparing variance between 
heterozygotes and homozygotes using Quicktest (ref. 77). In this analysis, we 
used only unrelated individuals identified genetically as of white British origin 
(n=57,715). Principal components were generated using these individuals and the 
first five were used to adjust for population structure as covariates in the analysis, 
in addition to a binary indicator for genotyping array. 

‘We also examined 4,908 mother-child pairs in ALSPAC and determined the 
parental origin of the alleles where possible. Briefly, the method used mother- 
child pairs to determine the parent of origin of each allele. For example, if the 
mother/child genotypes were AA/Aa, the childs maternal/paternal allele combi- 
nation was A/a. For the situation where both mother and child were heterozygous, 
the child’s maternal/paternal alleles could not be directly specified. However, the 
parental origin of the alleles could be determined by phasing the genotype data 
and comparing maternal and child haplotypes. We then tested these alleles for 
association with BW adjusting for sex and gestational age. 

Statistical power in these currently available sample sizes was insufficient to rule 

‘out widespread parent-of-origin effects across the regions tested. Using the mean (3 
(0.0345.d.) and MAE (0.28) of the identified loci, we estimate that we would need. 
at least 200,000 unrelated individuals or 70,000 mother-child pairs for 80% power 
to detect parent-of-origin effects at P < 0.00085. 
Hierarchical clustering of BW loci. To explore the different patterns of association 
between BW and other anthropometric/metaboliciendocrine traits and diseases, 
we performed hierarchical clustering analysis. The lead SNP (or proxy, EUR 
R?> 0.6) at the 60 BW loci was queried in publicly available GWAS meta- 
analysis datasets or in GWAS results obtained through collaboration”. Results 
‘were available for 53 of those loci and the extracted Z score (allelic effect/s.e., 
Supplementary Table 17) was aligned to the BW-raising allele. We performed 
two dimensional clustering by trait and by locus. We computed the Euclidean 
distance amongst Z scores of the extracted traits and loci and performed complete 
hierarchical clustering implemented in the pvclust package (http://wwwsigmath. 
es.osaka-t1.ac jp/shimo-lab/prog/pvclust/) in R v3.2.0 (http://www.R- project org/). 
Clustering uncertainty was measured by multiscale bootstrap resampling estimated 
from 1,000 replicates. We used «= 0.05 to define distinct clusters and, based on 
the bootstrap analysis, calculated the Calinski index to identify the number of 
‘well-supported clusters (cascadeKM function, vegan package, hitp://CRAN.R- 
project.org/package=vegan). Clustering was visualized by constructing dendro- 
‘grams and a heat map, 
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Separately from the hierarchical clustering analysis, we queried the lead SNP 
at EPASI in a GWAS of haematological traits because variation at that locus has 
previously been implicated in BW and adaptation to hypoxia at high altitudes in 
Tibetans" (Supplementary Table 17). 
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Extended Data Figure 1 | Flow chart of the study design. 
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‘Extended Data Figure 2 | Manhattan and quantile-quantile (QQ) plots 
of the trans-ancestry meta-analysis for BW. a, Manhattan (main panel) 
and QQ (top right) plots of genome-wide association results for BW from 
trans-ancestry meta-analysis of up to 153,781 individuals. The association 
Pyalue (on —logio scale) for each of up to 22,434,434 SNPs (y axis) was 
plotted against the genomic position (NCBI Build 375 x axis). Association 
signals that reached genome-wide significance (P< 5 x 10~) are shown 
in green if novel and pink if previously reported. In the QQ plot, the black 
dots represent observed P values and the grey line represents expected 

P values under the null distribution. The red dots represent observed 
Pvalues after excluding the previously identified signals®. b, Manhattan 
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Chromosome 
(main panel) and QQ (top right) plots of trans-ethnic GWAS meta- 
analysis for BW highlighting the reported imprinted regions described in 
Supplementary Table 14. Novel association signals that reached genome- 
wide significance (P <5 x 10~*) and mapped to imprinted regions are 
shown in green. Genomic regions outside imprinted regions are shaded 
in grey. SNPs in the imprinted regions are shown in light blue or dark 
blue, depending on chromosome number (odd or even). In the QQ plot, 
the black dots represent observed P values and the grey lines represent 
expected P values and their 95% confidence intervals under the null 
distribution for the SNPs within the imprinted regions. 
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Extended Data Figure 3 | Regional plots for multiple distinct signals 

at three BW loci, Regional plots for each locus, ZBTB7B (a), HMGAL 

(b) and PTCHI (c), are displayed from: the unconditional European- 
specific meta-analysis of up to 143,677 individuals (left); the approximate 
conditional meta-analysis for the primary signal after adjustment for the 
index variant for the secondary signal (middle); and the approximate 
conditional meta-analysis for the secondary signal after adjustment for the 
index variant for the primary signal (right). Directly genotyped or imputed 
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SNPs were plotted with their association P values (on a —logi scale) as a 
function of genomic position (NCBI Build 37). Estimated recombination 
rates (blue lines) were plotted to reflect the local linkage-disequilibrium 
structure around the index SNPs and their correlated proxies. SNPs were 
coloured in reference to linkage-disequilibrium with the particular index 
SNP according to a blue to red scale from R? = 0 to 1, based on pairwise 
R° values estimated from a reference of 5,000 individuals of white British 
origin, randomly selected from the UK Biobank. 
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Extended Data Figure 4 | Comparison of fetal effect sizes and maternal BW (unconditioned, n= 12,909); maternal_adjusted, maternal effect 


effect sizes at 60 known and novel birth weight loci, for the first 24 
loci. The remaining loci are shown in Extended Data Fig. 5a. For each 
BW locus, the following six effect sizes (with 95% CI) are shown, all 
aligned to the same BW-raising allele: fetal_GWAS, fetal allelic effect 

‘on BW (from European ancestry meta-analysis of up ton = 143,677 
individuals); fetal_unadjusted, fetal allelic effect on BW (unconditioned 
in = 12,909 mother-child pairs); fetal_adjusted, fetal effect (conditioned 
‘on maternal genotype, n= 12,909); maternal_GWAS, maternal allelic 
effect on offspring BW (from meta-analysis of up to n = 68,254 European 
mothers)’; maternal_unadjusted, maternal allelic effect on offspring 


(conditioned on fetal genotype, n= 12,909). The 60 BW loci were ordered 
by chromosome and position (Supplementary Tables 10, 11). These 

plots illustrate that, in large GWAS of BW, fetal effect size estimates are 
larger than those of maternal at 55 out of 60 identified loci (binomial 
P=1x 10"), suggesting that most of the associations are driven by the 
fetal genotype. In conditional analyses that modelled the effects of both 
maternal and fetal genotypes (11 = 12,909 mother-child pairs), confidence 
intervals around the estimates were wide, precluding inference about the 
likely contribution of maternal versus fetal genotype at individual loci, 
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Extended Data Figure 5 | Comparison of fetal effect sizes and maternal effect sizes at 60 known and novel birth weight loci, for the remaining 36 
loci. a, Continued from Extended Data Fig. 4. b, The scatter plot illustrates the difference between the fetal (x axis) and maternal (y axis) effect sizes in 
the overall maternal versus fetal GWAS results. 
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Extended Data Figure 6 | Protein-protein Interaction (PPI) Network 
analysis. a, The largest global component of BW PPI network containing 
13 modules is shown. b, The histogram shows the null distribution of 

Z scores of BW PPI networks based on 10,000 random networks, and 
where the Z scores for the 13 BW modules (M1-13) lie. For each module, 
the two most significant GO terms are shown. ¢, A heat map is shown, 
which takes the top 50 biological processes over-represented in the global 
BW PPI network (listed at the right of the plot), and displays the extent 
of enrichment for the various trait-specific “point of contact (PoC) PPI 
networks. d, e, Trait-specific PoC PPI networks composed of proteins that 
are shared in both the global BW PPI network and networks generated 


v ae if, ff f 


terse 
ey 


i 


‘oln of contact PP for BW an T2D and FG phenotypes, 
using the same pipeline for each of the adult traits: d, canonical Wat 
signalling pathway enriched for PoC PPI between BW and blood pressure 
(BP)-related phenotypes; and e, regulation of insulin secretion pathway 
enriched for PoC between BW and T2D/fasting glucose (FG). Red nodes 
indicate those present in PoC for BW and traits of interest; blue nodes 
correspond to the pathway nodes; purple nodes are those present in both 
the pathway and PoC; orange nodes are genes in BW loci that overlap 
with both the pathway and PoC. Large nodes correspond to genes in BW 
loci (within 300 kb from the lead SNP), and have a black border if they, 
amongst all BW loci, have a stronger (top 5) association with at least one 
of the pairing adult traits. 
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Extended Data Figure 7 | Quantile~Quantile (QQ) plots of variance 
‘comparison between heterozygotes and homozygotes analysis in 57,715 
UK Biobank samples and parent-of-origin specific analysis in 4,908 
ALSPAC mother-child pairs at 59 autosomal BW loci plus DLK1. a, QQ 
plot from the Quicktest analysis (ref. 77) comparing the BW variance of 
heterozygotes with homozygotes in 57,715 UK Biobank samples. b, QQ 
plot from the parent-of-origin specific analysis testing the association 
between BW and maternally transmitted versus paternally transmitted 
alleles in 4,908 mother-child pairs from the ALSPAC study (Methods, 


Expected -togt0p 

Supplementary Tables 15, 16). In both panels, the black dots represent lead 
SNPS at 59 identified autosomal BW loci and a further sub-genome-wide 
significant signal for BW near DLK1 (r86575803; P=5.6 x 10-8). The grey 
lines represent expected P values and their 95% confidence intervals under 
the null distribution for the 60 SNPs. Both results show trends in favour of 
imprinting effects at BW loci; however, despite the large sample size, these 
analyses were underpowered (see Methods) and much larger sample sizes 
are required for definitive analysis. 
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Extended Data Figure 8 | Summary of previously reported loci for SBP, 
CAD, T2D and adult height and their effect on birth weight. a-d, Effect 
sizes (left y axis) of previously reported 30 SBP loci'*"4, 45 CAD loci”, 

84 T2D loci™ and 422 adult height loci” were plotted against effects on 
BW (x axis). Effect sizes were aligned to the adult trait (or risk)-raising 
allele, The colour of each dot indicates BW association P value: red, 
P<5x 10-%, orange, 5 x 10-*< P< 0.001; yellow, 0.001 < P< 0.01; white, 
P> 0.01. The superimposed grey frequency histogram shows the number 
of SNPs (right y axis) in each category of BW effect size. e, Effect sizes 
(with 95% CI) on BW of 45 known CAD loci were plotted arranged in the 
order of CAD effect size from highest to lowest, separating out the known 


SBP loci. CAD loci with a larger effect on BW concentrated amongst loci 
with primary blood pressure association. f, Effect sizes (with 95% CI) on 
BW of 32 known T2D loci were plotted, subdivided by previously reported 
categories derived from detailed adult physiological data”’. Heterogeneity 
in BW effect sizes between five T2D loci groups with different mechanistic 
categories was substantial (Cochran's Q statistic Pg = 1.2 * 10°®). In 
pairwise comparisons, the ‘beta cell’ group of variants differed from the 
other four groups: fasting hyperglycaemia (Py =3 « 10~"'), insulin 
resistance (Pj. ~= 0.002), proinsulin (Pq) = 0.78) and unclassified 
(Pyq.=0.02) groups. All of the BW effect sizes plotted in the forest plots, 
were aligned to the trait (or risk)-raising allele, 


© 2016 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Extended Data Table 1 | Sixty loci associated with BW (P<5 x 10-) in European ancestry meta-analysis of up to 143,677 individuals 
and/or trans-ancestry meta-analysis of up to 153,781 individuals 
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2a, Effocts (1 values) were aligned tothe BW-raising alle. Effect allele frequency (EAF) was obtained trom the trans-ancestry meta-analysis, except for PLACT, for which the EAF was obtained fram 

the European ancestry meta-analysis due to lack of X chromosome data from the non-European studies. Chr, chramosore; bp, base par: b37, bulld 37: EAF effect allele frequency; SE. standard error. 
b, The afect of the lead SNP (absolute value oy axis) is given asa function of minor allele frequency (axis) for 60 known (pink) and novel (green) BW loc from the trans-ancestry meta-analysis, 
Error bars are proportional tothe standard errar ofthe effect size. The dashed line indicates 80% power ta detect association at genome-wide significance level forthe sample size in trans-ancestry 
meta-snalyss 
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Extended Data Table 2 | Gene set enrichment analysis and protein-protein interaction (PP!) analysis 
a. Gene set enrichment analysis 


‘5th percentile enrichment 75th percentile enrichment 
cutoff cutoff 
Number of Expected Expected 
genes (observed) (observed) 
(mapped to number of number of 
Database Gene set MAGENTA) Pp FDR__genes P FDR genes 
GOTERM Positive regulation of glycogen 
biosynthetic process 10 (10) 5.8x10° 0,005 1(5) 3.6x107 0.18 3(7) 
GOTERM __Insulin-Ike growth factor receptor binding 13 (13) 24x10 0.006 16) 002 035 = -3(7) 
GOTERM Positive regulation of glucose import 22 (22) 1.0x10* — 0.019 1(7) 0.02 0.36 6 (10) 
GOTERM Insulin receptor signalling pathway 35 (34) 2.8x10° 0.022 2(9) 4.3x10° 0.27 9(16) 
GOTERM ‘Chromatin remodelling complex 11 (9) 9.0x10* 0.036 0 (4) 0.16 0.55 2(4) 
KEGG Glycosphyngolipid biosynthesis. globo- 
series 14.(13) 2.6x10° 0,037 1 (4) 0.21 0.48 365) 
KEGG Melanoma 71 (67) 116x108 0.0378 (10) 0.05 0.35 17 (23) 
KEGG Terpenoid backbone biosynthesis, 15 (15) 5.9x10° 0.039 1(1) 015 0444 (6) 
KEGG ‘Type 2 Diabetes Mellitus 47 (45) 22x10° — 0.040 2(8) 0.14 0.46 = 11 (15) 
Panther Cholesterol biosynthesis 14(11) ‘ext0° 0.040 1 (4) 029° «064 = 3 (4) 
BIOCARTA — Growth hormone pathway 28 (27) B.0x10* 0.044 1(7) O11 © 0.25 7(10) 
KEGG ocyte meiosis 114(108) —1.0x10" 0.0485 (14) 007 04527 (84) 
Custom gene set of imprinted genes 
GTEX Imprinted genes (All) 77 (72) 1oxtoe 4(12) ont 18 (23) 
GTEX Imprinted genes (Primary) 38 (35) Bax10o 216) 0.14 - (12) 
GTEX Imprinted genes (Primary + Suggestive) 55 (50) 0.010 - 3(7) 0.25 : 13:(15) 


b. Protein-protein interaction analysis 


Number of genes 
(overlapped with 


Database Pathway PPI network) __Zscore P adjusted P* 
GOTERM Epidermal growth factor receptor signalling pathway 198 (31) 7.97 3.ax10 14x10 
GOTERM _Insulin receptor signalling pathway 151 (26) 7.90 11x10" 29x10" 
GOTERM Stimulatory C-type lectin receptor signalling pathway 421 (22) 7.59 75x10" 1.2x10° 
GOTERM Negative regulation of canonical Wnt signalling pathway 152 (25) 7.46 62x10 12x10" 
GOTERM Notch signalling pathway 129 (22) 7.21 2.6x10° 3.3x10° 
GOTERM Cellular response to insulin stimulus 71(16) 7.62 3,7x102 41x10 
GOTERM Positive regulation of glycogen biosynthetic process 15 (8) 9.39 5.3x10° 5.1x10° 
GOTERM Positive regulation of protein phosphorylation 114 (20) 7.03 6.8x10° 5.9x10° 
GOTERM —_ Positive regulation of glucose import 27 (10) 8.42 83x10° 65x10" 
GOTERM __ Fe-epsilon receptor signalling pathway 186 (26) 6.58 96x10" __6.8x10" 


‘Twa complementary analyses ofthe averall GAS summary data ientifid enrichment af BW association in biological pathway’ related te metabolism, growth and development. a, The top results 

(FDR.0.05 at the 95th percentile enrichment threshold) ram a total af 3,216 biological pathways tested for enrichment of multiple modest associations with BW. Additionally, results are shawn for 

custom sets af imprinted genes: Primary, genes identfied as highly likly to be imprintad in the GTEx database (tested n ~28); Primary + suggestive, genes icentfied as highly likely and suggestively 
in GTEx (n=58); Al the above plus genes selected from the literature where imprinting status is consistent in GTEx (n=77)-b, The results ofa complementary analysis of empirical PPI data, 

displaying the top 10 most significant pathways enriched for BW-association score. 

*P value is adjusted for multiple corection using the Benjamini-Hochberg method. 
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Acholinergic basal forebrain feeding circuit 
modulates appetite suppression 


Alexander M. Herman!, Joshua Ortiz-Guzman!, Mikhail Kochukov”, Isabella Herman!, Kathleen B. Quast”, Jay M. Patel, 


Burak Tepe’, Jeffrey C. Carlson!, Kevin Ung', Jennifer Selev 


Atypical food intake is a primary cause of obesity and other 
eating and metabolic disorders, Insight into the neural control of 
feeding has previously focused mainly on signalling mechanisms 
associated with the hypothalamus’, the major centre in the 
brain that regulates body weight homeostasis”. However, roles of 
non-canonical central nervous system signalling mechanisms in 
regulating feeding behaviour have been largely uncharacterized. 
Acetylcholine has long been proposed to influence feeding*!” 
owing in part to the functional similarity between acetylcholine and 
nicotine, a known appetite suppressant. Nicotine is an exogenous 
agonist for acetylcholine receptors, suggesting that endogenous 
cholinergic signalling may play a part in normal physiological 
regulation of feeding. However, it remains unclear how cholinergic 
neurons in the brain regulate food intake. Here we report that 
cholinergic neurons of the mouse basal forebrain potently influence 
food intake and body weight. Impairment of cholinergic signalling 
increases food intake and results in severe obesity, whereas 
enhanced cholinergic signalling decreases food consumption. We 
found that cholinergic circuits modulate appetite suppression 
on downstream targets in the hypothalamus. Together our data 
reveal the cholinergic basal forebrain as a major modulatory centre 
underlying feeding behaviour. 

‘The diagonal band of Broca (DBB) (Extended Data Fig. 1a-g) is a 
major component of the cholinergic basal forebrain", Studies have 
shown that cell types associated with feeding are connected with the 
DBB!", Given this correlation, we sought to determine if cholinergic 
neurons in the mouse DBB modulate feeding. With feeding after an 
overnight fast, we observed co-localization between c-Fos and the cho- 
linergic-specific marker, choline acetyltransferase (ChAT) (Fig. 1a~f). 
Given their activation in response to food intake, we next assessed 
the necessity of cholinergic DBB neurons in feeding modulation. We 
stereotaxically targeted cholinergic DBB neurons for ablation using 
a Cre-dependent"* adeno-associated virus (AAV) that expressed an 
enhanced yellow fluorescent protein (EYEP)-linked diphtheria toxin 
receptor in Chat-cre*!~; R26SdTomatol> mice, DBB cholinergic neu- 
rons were mainly decimated after treatment with diphtheria toxin, 
whereas those in surrounding areas of the brain were unaffected 
(Fig. 1g-n). Within two weeks after ablation, we observed hyperpha- 
gia (Fig. 10) and severe obesity (Fig. 1p-q). To assess the extent that 
food consumption contributed to this phenotype, we conducted paired 
feeding assays on obese DBB-ablated animals versus their non-ablated 
controls. Control mice exhibited normal food intake and minimal 
weight gain, whereas DBB-ablated animals lost weight when available 
food was limited to control levels (Fig. 1r-s), suggesting that hyper- 
phagia was required to maintain obesity. This effect did not appear 
to be mediated by changes in energy expenditure, as overall levels of 
activity (Fig, 1t) and oxygen consumption (Extended Data Fig. 2a) 
were unaffected during early stages post-DBB ablation. Metabolic 


', Qingchun Tong? & Benjamin R. Arenkiel!24> 


activity was only altered once obesity became a burden (Extended Data 
Fig. 2b-k). 

We sought to determine whether conditional removal of cholinergic 
neurotransmission selectively from the DBB, without cell death, led to 
increased food consumption and obesity. We stereotaxically delivered 
an EGFP-Cre-expressing AAV into the DBB of Chat’***"? homozy- 
‘gous mice (Fig. 2a), and evaluated ChAT knockout by immunohisto- 
chemistry (Fig. 2b-k). Counts of ChAT-immunopositive neurons from 
control and experimental brains showed a 72% and 55% decrease in 
ChAT expression in the DBB of Cre-expressing female and male mice, 
respectively, compared to controls (Fig. 21). Consistent with a role for 
cholinergic signalling in feeding behaviour, Cre-expressing animals 
displayed increased food intake (Fig. 2m) and subsequent weight gain 
(Fig. 2n), Because knockout is restricted to the Chat gene, these data 
imply that the observed effects on feeding and body weight were medi- 
ated by cholinergic signalling, although it does not rule outa role for 
GABaAergic neurotransmission from cholinergic DBB neurons, which 
have recently been reported to co-release GABA! (Extended Data 
Fig. 3a-c). 

To test how acute inactivation of these neurons affected food intake, 
Chat-cre*!~ mice were DBB-targeted for conditional expression of 
hM4D-EGEP (Extended Data Fig. 4a), Following CNO treatment, 
hM4D-EGEP-expressing mice consumed more food compared to con- 
trols (Extended Data Fig. 4b), suggesting that both acute and chronic 
inactivation of cholinergic DBB neurons were sufficient to increase 
food intake. 

We sought to determine if activating DBB cholinergic neurons was 
sufficient to suppress food intake in vivo. To test this, we expressed 
channelrhodopsin-2 (ChR2) in DBB cholinergic neurons, followed by 
monitoring feeding behaviour during photostimulation (Fig. 3a-g). 
We found that prolonged (48h) activation of cholinergic DBB neurons 
significantly decreased daily food intake under conditions of normal 
food access (Fig. 3h, i), whereas mock-stimulated, non-ChR2-expressing 
controls showed no significant changes in feeding behaviour (average 
daily food intake was 4.26 g (0.22, s.e.m.); 112.48% of baseline 
(45.91%, s.e.m,), =3). To test the acute effect of DBB stimulation 
on feeding, we fasted ChR2-expressing mice overnight and presented 
them with chow in the morning. We observed that acute cell body 
stimulation resulted in a similar overall decrease (approximately 25%) 
in food intake (Fig. 3)-k). 

We selectively targeted cholinergic DBB neurons for conditional 
viral expression of a presynaptically localized, synaptophysin-EGEP 
(Extended Data Fig. 5a-e), allowing the identification of their pre- 
sumptive downstream synaptic targets. Using this method, we 
observed DBB neuron terminals in previously reported areas of 
the brain'! (Extended Data Fig. 5f-n), as well as in ventral-medial 
domains of the hypothalamus (Extended Data Fig. 5o-u), which 
have previously been shown to express markers for cholinergic 


Program in Developmental lology, Baylar College of Medicine, Houston, Texas 77030, USA “Department af Molecular and Human Genetics, Baylor Cllege of Medicine, Houston, Texas 77030, 


USA Medical Scientist Training Program, Baylor College of Medicine, Houston, Texas 77030, USA, “Department of Neuroscienct 
Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston, Texas 77030, USA. 


77030, USA, 


16 Macmillan Pul 


hers Limited, part af Springer Nature. All rights res 


Houston, Texas 77030, USA. 4lan and 
ston, Texas 


Baylor College af M 
3f Molecular Medicine, University of Te 


12 OCTOBER 2016 
ved. 


VOL 538 


NATURE | 253 


LETTER 


ohm 
21) 

oS | 
i 

— _, Hin 3 Stet 
1 BL ee 5 

ai Mae 8 te S : 

iE pase NE i % : 

a 


Figure 1 | DBB-specific cholinergic cell death results in hyperphagia 
and obesity. a-f, c-Fos and ChAT immunohistochemistry in the HDB. 
DBB of saline-treated (g-j) or diphtheria toxin treated (k-n) animals. 
ale bars, 100j1m. o, Food intake between DBB-ablated (n= 15 mice) 
and non-ablated animals (n= 13 mice). Time points represented as 

mean +5.e.m., P= 0.0002, F (1, 26) = 18.06 by two-way ANOVA with 
repeated measures. p, Body weight between DBB-ablated (n = 15) and 
non-ablated animals (n= 13), Time points represented as mean + s.e,m., 


terminals of unknown origin"®, Within this region resides the arcuate 


nucleus of the hypothalamus, a site that contains well-characterized 
cell populations that conversely regulate hunger and satiety through 
NPY/AgRP-expressing and POMC-expressing neurons, respectively”. 
As cholinergic basal forebrain neurons co-express GABA, it is possible 
that loss of cholinergic neurons from the DBB could disinhibit AgRP 
neurons, thereby promoting food intake through elevated expression 
levels of Agrp. To test this, Agrp transcripts were assessed in DBB- 
ablated animals that were pair-fed to respective controls shortly after 
ablation, before hyperphagic-induced obesity. Under these conditions, 
Agrp transcript analysis from the arcuate nucleus of DBB-ablated ani- 
mals showed no significant decrease in levels of Agrp compared to con- 
trols (Extended Data Fig, 6a). Interestingly, however, Pomc transcript 


‘AnLONATIAAV-EOP 


exoressing OMT (94) 


Daly food nak i) 


P-<0.0001, F (1, 26) = 143.1 by two-way ANOVA with repeated measures. 
4q, Representative DBB-ablated and non-ablated mice. r, s, Average daily 
food intake (r) and body mass (normalized with respect to the day-0 

) during paired feeding (n =5 mice per group), pink shading 

ents restrictive period. t, Total time active per day before 

ind during early and late stages of obesity. Data represented as 
‘P<0.05 by two-sided, unpaired Student's t-test. 


mean+s.em., 


levels were strikingly reduced in DBB-ablated mice (Extended Data 
Fig. 6a), suggesting that cholinergic DBB neurons may normally act 
to modulate downstream satiety pathways, Previous reports show that 
POMC neurons are activated by cholinergic nicotinic agonists, how- 
ever, their effect on AgRP/NPY neurons appears much more variable. 
From our own recordings (Extended Data Fig. 6b), acetylcholine did 
not show fast excitation on NPY neurons, perhaps due to muscarinic 
acetylcholine receptor (mAChR) activity or through broader network 
eff 


Recent studies have implicated the arcuate nucleus in appetite sup- 
pression via AChR signalling onto POMC neurons", and conditional 
rabies virus tracing experiments suggested that arcuate POMC neu- 
rons receive significant input from neurons in the DBB", Whole-cell 


Figure 2 | Chat conditional knockout from 
the DBB leads to obesity and hyperphagia. 

a, Strategy for Chat conditional knockout. 

b-k, ChAT immunostains of DBB in EGFP- 

ing (bf), or EGFP-Cre-expressing (g-k) 
PKS? brains, Scale bars represent mm, 
(c,h) or 100)1m (£,k). 1, Percentage of DBB 
neurons expressing ChAT (1 =3 per gender). 
Data represented as mean + s.e.m., *P-<0.05, 
**P< 0.01 by two-sided, unpaired Student's 
Hest. m, n, Daily average food intake (mm) 

and average body mass (n) (12 weeks after the 
conditional knockout,  =7 females, 5 males for 


EGEP-Cre or EGFP mice, n=6 females, 6 males 
for wild type). Data represented as mean +s.e.m., 
*P-<0.05, **P-< 0.01 by two-sided, unpaired 

Student's t-test. Schematic in a was adapted from 


an image from the Allen Brain Institute Reference 
Allas®, 
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Figure 3 | In vivo ChR2-mediated DBB 
stimulation decreases food intake. 

a, ChR2::EYEP in cell bodies of the DBB. Scale 
bar, Imm, Membrane-bound ChR2::EYFP is 
not restricted to cell bodies and shows neuronal 
projections. b-d, ChR2::EYFP in DBB neurons 
after stimulation and stained to identify c-Fos, 
¢,f, Strategy for ChR2-assissted feeding assay. 

g, Representative recordings from a ChR2- 
expressing cholinergic DBB neuron (10 ms 
pulses, 20 Hz, 5 trains, 305 intervals). h, i, Raw 
(h) and normalized (with respect to the mean 
pre-stimulation value; i) food intake value: 


z 
° $150 before, during, and post-stimulation (n=7 
35 Es mice).j, k, Raw (j) and normalized (with respect 
#3 i = to the mean no-stimulation value; k) food intake 
e Ghat re Be Eso: values in the absence or presence of stimulation 
+0) >> GREER 1 i 2s (n=5 mice). Data represented as mean +s.e.m,, 
Opes Son Posts Tpesin sim Postsim “P< 0.05 by two-sided, paired Student’ t-test 
i k Schematic in e was adapted from an image from 
eae Zio se the Allen Brain Institute Reference Atlas® 
Bos ES 
Bas: ine 
28 5c 


recordings from labelled POMC neurons in Pomc-EGEP’ 

showed that neuronal firing increased in the presence of acetylcholine 
(Extended Data Fig. 6c-d), and this response was blocked by AChR 
blockers (Extended Data Fig. 6e). To assess potential connectiv- 
ity between the DBB and arcuate nucleus, we implemented retro- 
gradely transported canine adenovirus (CAV-2)'°. Using conditional 
R268 ATomate!* mice, we delivered CAV-Cre into the arcuate nucleus, 
and found that cholinergic DBB neurons provide input (Fig, 4a-e). 
Non-cholinergic cell types in the DBB were also labelled, suggest- 
ing that both cholinergic and non-cholinergic cell types from the 
DBB provide input into the hypothalamus to regulate its functions. 
To investigate this connectivity in vivo, Chat-cre*'~ mice were tar- 
geted for conditional ChR2::EYFP expression in DBB choliner- 
gic neurons, and a fibre optic was implanted into the ventral space 
of the third ventricle to stimulate DBB terminals within the ventral 
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hypothalamus (Fig. 4f). ChR2-expressing and non-ChR2-express- 
ing (mock-stimulated) implanted animals were fasted overnight 
and presented with chow for two hours. Mock-stimulated animals 
showed no significant changes in feeding behaviour during pho- 
tostimulation (average was 1.381 g (+0.1429g, s.e.m.); 117.1% of 
baseline (12.12%, s.e.m.), n= 6 animals), whereas stimulated 
ChR2-expressing animals exhibited diminished food intake com- 
pared to non-stimulated conditions (Fig. 4g-h), consistent with 
decreases observed from DBB cell body stimulation, and support- 
ing anatomical tracing data. Although these data support a role for 
cholinergic modulation of the arcuate nucleus, given sparse labelling of 
terminals throughout the hypothalamus, as well as dense innervation 
of the median eminence (Extended Data Fig. 7a-h), these results do 
not rule out an influence of other hypothalamic sites and cell types. 
Lastly, to test if this effect was AChR-mediated, we systemically injected 


Figure 4 | Stimulation of cholinergic DBB 
terminals in the hypothalamus suppresses 
food intake. a, Strategy for CAV-Cre-mediated 
circuit mapping. b, tdTomato expression in the 
hypothalamus, ¢, ChAT immunohistochemistry, 
4d. ¢, tdTomato expression (d) and merged (e) 
ages in the DBB, Scale bars, 200 jum (top row). 
ective zoomed images from the dotted 
boxes in ¢-e are shown in the bottom row. Scale 
bar, 100 jum, White arrows point to neurons 
showing ChAT and tdTomato co-localization. 

£, Strategy for ChR2-assisted in vivo behaviour. 
gh, Total (g) and normalized (with respect to 
the mean no-stimulation value; h) food intake 
(n=8mice) after two-hour re-feeding after 
overnight fast. Data represented as mean 
*P= <0.05 by two-sided, paired Student’ t-test 
Schematics in figures a and f were adapted from 
images from the Allen Brain Institute Reference 
Atlas", 
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photostimulated animals with the nicotinic cholinergic receptor 
(nAChR) antagonist mecamylamine, which has previously been shown 
to block nAChR-mediated decreases in food intake. In the presence of 
mecamylamine, photostimulated animals showed a blunted response 
compared to stimulation alone (Extended Data Fig. 82-b), suggesting 
that decreased food intake after stimulation was mediated in part by 
‘ACHR signalling. Notably, however, we did not observe a complete phe- 
notypic blockade, suggesting that mACHR signalling may play a syner- 
gistic role in modulating food intake, and may therefore explain why 
nACAR antagonism alone was not sufficient to fully suppress feeding. 

‘Together, our data demonstrate a powerful role for cholinergic basal 
forebrain neurons in modulating food intake. Cholinergic systems are 
highly druggable, which may offer a new avenue for targeting cho- 
linergic mechanisms to help treat eating disorders. AChR transcript 
analysis from the arcuate nucleus showed expression of common AChR 
subunits (Extended Data Fig. 9a), serving as possible targets for phar- 
macological interventions, Understanding the ACAR profiles of promi- 
nent feeding-associated cell types will surely be useful in this respect”. 
‘Additionally, revealing the contribution of other signalling mech- 
anisms from cholinergic neurons will be essential in understanding 
their composite roles in feeding. Cholinergic and other neuromodula- 
tory systems are known to co-express multiple neurotransmitters and/or 
neuropeptides!®!*, Although cholinergic signalling is sure to play 
an important role, it remains unclear how dual transmitter systems 
function in concert to modulate their targets. Although our studies 
indicate a modulatory effect of acetylcholine on POMC neurons to 
influence food intake, itis possible that other cell types of the hypothal- 
amus and/or other brain regions may be affected ina manner yet to be 
determined by cholinergic innervation. As such, itis possible that our 
observed phenotypes are not due solely to downstream effects in the 
arcuate nucleus alone, but may bea combinatorial effect of altered cho- 
linergic signalling at other projection sites or cell types of the hypothal- 
amus or other innervated brain regions. Itwill be especially interesting 
to determine the role of alternative cholinergic centres in modulat- 
ing feeding behaviour such as those associated with the reward and 
addiction pathways of the mesolimbic system. The boundary between 
homeostatic and non-homeostatic (hedonic/aversive) control of feed- 
ing is not always distinct or clear-cut, and often converges", Given 
its roleas.a powerful modulator of reward”* and aversion®*””2°, and 
its involvement in addiction®™”, acetylcholine is a prime candidate that 
may link known canonical homeostatic mechanisms that govern feed- 
ing behaviour, with the reward and/or aversive aspects of food intake. 
Nonetheless, fully understanding the relationship between cholinergic 
systems and their role in modulating feeding behaviour is an important 
step towards uncovering a comprehensive profile of feeding regulation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only inthe online paper. 
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METHODS 

Data reporting. No statistical methods were used to predetermine sample size 
When relevant (suchas experiments requiring multiple trials) randomization was 
carried out, Behavioural trial experiments were randomized. There was blinding 
of initial allocation of animals into groups, but not thereafter. Cell counts were 
blinded. 

Experimental mouse lines. All animals used in this study were treated in compli- 
ance with US Department of Health and Human Services and Baylor College of 
Medicine IACUC guidelines. For the studies reported here, both male and female 
mice were considered for analyses. Standard pellet mouse chow (Harlan, 2920X) 
‘was used for all experiments, and all animals were maintained on a normal 12-h 
light-dark cycle. Chat-cre (B6;129S6-Chat'""""""™/}), Pome-EGEP (C57BL/6}- 
‘Tg(Pome-EGFP)|Low/}), Npy-hrGEP (B6.EVB-Tg(Npy-hrGEP)1Lowl/J), 
Chasis"? (B6.129-Chat'™/]), R26'- TMH! (B6.Cg-Gt(ROSA) 
26Sor'™ CAG Tomo} /)), and Vegat-cre (Sle32a1"™2!/}) mice were originally 
purchased and are available from Jackson Laboratories, Chat-cre™!~; R26“ ‘tat 
mice were generated by crossing heterozygous male Chat-cre*/~ mice with female 
homorygous R264" mice. Chat*!"**? animals were bred and maintained as 
homozygotes, A Vgat-cre homozygous male was crossed to female CS7BL/6] mice 
to generate heterozygous Vgat-cre*’~ animals. Chat-cre*!~, Pome-EGFP*'~ and 
Npy-hrGEP*"~ mice used in this study were maintained as heterozygotes and bred 
to wild-type C57BL/6] female mice. Genotyping for Chat", Pome-EGEP, and 
[Npy-lrGEP animals was done according to available Jackson Laboratory protocols 
for these strains. Genotyping for Cre was done using primers for Cre recombinase 
detection (forward pri CATT TCTGGGGATTGCTTA-3,, reverse primer: 
5/-GTCATCCTTAGCGCCGTAAA-3)) 

‘Microscopy and immunohistochemistry. Animals were deeply anaesthetized 
using isoflurane and were transcardially perfused with PBS followed by 10% neu- 
tral buffered formalin (NBE, Azer Scientific). Brains were dissected and post-fixed 
in 10% NBF overnight at 4°C. Brains were cryoprotected in a 20% sucrose/PBS 
solution at 4°C for one day, followed by a 30% sucrose/PBS solution at 4°C for one 
more day. Brains were then embedded and frozen in OCT and stored at 80°C. 
Brains visualized using endogenous or virally-expressing fluorescent reporters 
were cut using a cryostat (Leica CM1860) in coronal sections at 25-30jim, For 
(ChAT and §-endorphin immunohistochemistry, 40jum free-floating sections were 
blocked for 1h at room temperature in 10% horse serum blocking solution, made in 
PBS-TC (1x PBS, 0.5% Triton-X 100, 0,1 mM CaCl, pH 7.35). Sections were then 
incubated overnight at 4°C at a 1:200 dilution of block solution containing goat 
anti-ChaT primary antibody (Millipore, AB144P) or rabbit anti--endorphin pri- 
mary antibody (Phoenix Pharmaceuticals, H-022-33). Sections were then washed 
4 times, 30min each in plain PBS-TC. Sections were then incubated in secondary 
antibody (donkey anti-goat Alexafluor-488 or Alexafluor-555, Life Technologies) 
ata 1:200 dilution for 3h at room temperature. Sections were then washed 4 times 
for 30min each in PBS-TC. All sections were mounted using DAPI Fluoromount-G 
(Southern Biotech, 0100-20). Detection of fluorescent expression was performed 
using a Leica TCS SPE confacal microscope under a 10% or 20x objective. 
Stereotaxic injections and viral constructs. For al stereotaxic injections, mice 
‘were anesthetized using a ketamine/dormitore mixture and were maintained under 
anaesthesia using vaporized isoflurane with O3. All injections were performed 
using. stereataxic apparatus synced to Angle Two software for coordinate guid- 
ance, For DTR-mediated cell death of cholinergic neurons, female Chat-cre*/~ 
or Chat-cre*'~; R26!Ststtonto/* mice (S10 weeks old) were bilaterally injected 
into the horizontal limb of the diagonal band of Broca (HDB, right hemisphere, 
from bregma: AP = +0.14, DV = —5.80, ML = —1.29; left hemisphere, from 
bregma: AP=+0.14, DV = —5.74, ML=+1.17) with 500. per hemisphere of 
a Cre-dependent AAV-EFla-FLEX-DIR-P2A-EYFP-WPRE-iGHpA, serotype 
DJ/8. For conditional Chat knockout experiments, Chat'*”""**” animals (8-10 
weeks old) were bilaterally injected into the HDB with 300 nl per hemisphere of 
AAV-EFla-EGEP-P2A-CRE-WPRE-hGHpa for experimental animals, or AAV- 
EF 1o-EGFP-WPRE-hGHpA for control animals (serotype DJ/8 for both AVS). 
For synaptophysin tracing experiments using the EGEP variant, Chat-cre""~ mice 
(8-16 weeks old) were injected bilaterally into the HDB with 500 nL per hemi- 
sphere of a Cre-dependent AAV-EF1a-FLEX-Syn::EGEP-WPRE-hGHpA, serotype 
DJ/8, For synaptophysin tracing experiments using the mRuby2 variant, Chat-cre""; 
Pome-EGEP*'~ mice (12 weeks old) were injected bilaterally into the HDB with 
‘500ml per hemisphere of a Cre-dependent AAV-EF la-FLEX-Syn:imRuby2-WPRE- 
GHpA, serotype DJ/8, Lastly, for in vivo ChR2 behaviour experiments, male 
Chat-cre*’~ mice (12-14 weeks old) were bilaterally injected into the HDB with 
500:nl per hemisphere of a Cre-dependent AAV-EFla-DIO-hChR2(HI34R)-EYEP- 
WPRE-GHpa (Addgene, plasmid number 20298) serotype 2/9. 

‘Neuronal activation after feeding and c-Fos IHC. Male wild-type animals were 
fasted overnight before being presented with standard pellet mouse chow (Harlan, 
2920X) for 3h the following morning (fed group), while a second group of mice was 
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fasted overnight but not presented with chow (fasted group). Mice were then imme- 
diately euthanized and perfused with PBS and 10% NBF. Brains were fixed overnight 
in 10% NBF before two overnight fixations in 20% and 30% sucrose/PBS solu- 
tions, Brains were frozen in OCT cutting compound and cryosectioned at 35m. 
Sections were then blocked for 1h at room temperature in 10% horse serum 
blocking solution, made in PBS-TC (1 PBS, 0.5% ‘Triton-X 100, 0.1 mM CaCl, 
pH 7.35). Sections were then incubated overnight at 4°C at a 1:200 dilution of 
block solution containing goat anti-ChAT primary antibody (Millipore, ABL44P) 
and 1:500 dilution of rabbit anti-c-Fos antibody (Calbiochem, PC38). Sections 
were then washed 4 times for 30min each in plain PBS-TC, Sections were then 
incubated in secondary antibodies (donkey anti-goat Alexafluor-488, and donkey 
anti-rabbit Alexafluor-546) at a 1:200 dilution each for 3h at room temperature. 

Sections were then washed 4 times for 30min each in PBS-T. All sections were 
mounted using DAPI Fluoromount-G (Southern Biotech, 0100-20). Detection of 
fluorescent expression was performed using a Leica TCS SPE confocal microscope 
under a 20>: objective. 

DTR-mediated cell death and Chat conditional knockout assays. After allow- 
ing 10-14 days for conditional viral expression (injections and viral construct, 
described previously), mice were intraperitoneally (i.p.) injected 3 times daily 
for 5 days with 800 ng (4ngul-' working solution) of diphtheria toxin (Sigma, 
10564) for optimal cell death of targeted cholinergic neurons, Female Chat-cre”~; 
R2g!St-stTonte/ mice were used initially to validate DTR-mediated cell death 
by visualizing DBB cholinergic cell loss. For remaining experiments, female 
Chat-cre*'~ animals were used. For controls, age-and gender-matched Chat-cre"’~ 
mice (stereotaxically injected identically into the DBB with AAV-FLEX-DTR-P2A- 
EYFP) were injected with equal volume (200}il per injection) of sterile saline. Body 
‘weights and daily food intake were measured and averaged per group for each 
time point presented. For Chat conditional knockout assays, Chat?" mice 
\were injected bilaterally into the HDB as previously described with either AAV- 
EFlo-EGEP-P2A-CRE- WPRE-hGHpA for conditional knockout animals or AAV- 
EFla-EGFP-WPRE-hGHpa for controls, Body weights and daily food intake were 
measured and averaged per group for each time point presented. For cell counts 
after Chat conditional knockout, 3 mice from each group were euthanized and 
brains were sectioned at 401m for ChAT immunohistochemistry. 10 sections 
representing the anterior, central, and posterior areas of the DBB were chosen, 
and blinded, total cell counts based on ChAT immunoreactivity were tallied. A 
count from all 10 sections from a single mouse brain were totalled and averaged 
for each group of 3mice. Data were normalized to control levels of expression and 
represented as a mean percentage 
Activity monitoring and metabolic assays. Activity (reported as time active per 
day), 02 consumption, and metabolic blood assays were performed by the Baylor 
College of Medicine Mouse Metabolism Core before obesity phenotypes (3 days 
afier diphtheria toxin (DT) treatment), during early stages of obesity and hyper 
phagia (3 weeks post-DT treatment), and at late stages of obesity and hyperphagia, 
(3 months post-DT treatment). Activity and O; consumption assays were per- 
formed using the Oxymax Comprehensive Laboratory Animal Monitoring System 
(CLAMS, Columbus instruments). Lean mass and body fat content was assessed 
using quantitative MRI. Blood panel assays for cholesterol, leptin, insulin, and 
glucose were also performed by the Baylor College of Medicine Mouse metab- 
olism core. Blood was collected via the tail vein. Mice were fasted for 4h before 
measuring blood glucose, 

Paired feeding assays, Paired feeding assays were performed with individually 
housed, male DT-treated (DBB-ablated) and saline-treated (non-ablated) animals. 

Assays for determining the contribution of food intake on maintaining obesity 
were conducted on animals 12-weeks post-ablation. First, daily body weight 
and ad libitum food intake for all animals was recorded for 7 days to establish 
baselines for all animals. Then 1 control mouse and 1 experimental mouse were 
then randomly paired. All food from experimental cages was removed and only 
an equivalent amount of food consumed the previous day by a mouse’s respec- 
tive control partner was introduced to the cage. This restrictive period was done 
for 21 days. Afterwards, all experimental mice were allowed to resume to feed 
ad libitum once again and food intake and body weight were measured daily for 
2 weeks, Change in body weight over time was normalized as a percentage of day 1 
initial starting weight for each individual animal, For paired feeding conducted on 
animals used for Agrp and Pomc transcript analysis, assays were performed 3 days 
post-ablation to prevent significant weight gain from hyperphagia. A restrictive 
feeding period was conducted for 21 days, after which mice were euthanized on 
the morning after the final day, and hypothalamic tissue was harvested for RNA 
purification and subsequent RT-PCR (see below). 

In vivo optogenetic behaviour assays. Concurrent with AAV-EFIa-DIO- 
‘wChR2(H134R)-EYFP- WPRE-hGHpA injections (as described previously), male 
Chat-cre*!~ mice were bilaterally implanted with 200m silica fibre optic implants 
made in-house (Thor Labs, TS1249968); 230}1m ferrules (Precision Fibre Products, 
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MM-FER2007C-2300) and situated 0.1 mm above the viral injection site, Fibre 
optic implants were held in place by a cap made from adhesive cement (C&B 
Metabond Quick! Cement System (Parkell)) for initial base, and crosslinked flash, 
acrylic (Yates-Motloid, 44115 and 44119) for headcap. Mice were allowed at least 
two weeks for recovery and expression of the virus before assays were performed. 
For prolonged 2-day stimulation, each mouse was allowed 48h to acclimate in a 
behaviour box with free access to food and water (days 1 and 2: acclimation). In 
addition, acclimation occurred while tethered to a dual fibre optic cord (Doric 
Lenses) attached to a 473 nm laser source (CrystaLaser CL-2005). After the 48-h 
acclimation period, a pre-measured amount of food was placed into the chamber 
and weighed once every 24h for two days without stimulation (days 3 and 4: pre- 
stimulation). Over the next 48h, food was weighed once each day while mice 
were chronically stimulated with trains of blue light (5mW, 10ms pulses, 20 Hz, 
5s trains, 30s intervals) (days 5 and 6: stimulation). Finally, food was weighed 
‘once every 24h for two final days with no blue light stimulation (days 7 and 8 

post-stimulation). As a control group, non-ChR2-expressing mice were injected 
and implanted in the identical way used for experimental mice. Control mice were 
acclimated identically and were subsequently subjected to a mock stimulation 
for 48h, and food intake was measured each day. For comparisons between pre- 
stimulation, stimulation, and post-stimulation conditions, paired Student’ t-tests 
were used. For comparisons between experimental conditions and the control, 
(mock-stimulation) condition, unpaired Student’ t-tests were used. For short-term 
2-hstimulation experiments, mice were given 48h to acclimate in their behaviour 
chamber. After acclimation, mice were fasted overnight and subsequently pre- 
sented with a pre-measured amount of food in the morning, For control conditions, 
mice were not stimulated and food intake was recorded every 30min for 2h total. 
For experimental conditions, mice were stimulated with trains of blue light (5 mW, 
10ms pulses, 20 Hz, 5s trains, 30s intervals) for 15min before presentation of 
pre-measured chow, and food intake was recorded every 30min for 2h total in 
presence of continued blue light illumination, Trials were randomized and con- 
ducted one week apart on the same animals. For experiments targeted at terminal 
stimulation in the arcuate nucleus of the hypothalamus, male animals were bilat- 
erally injected into the HDB with AAV-EFa-DIO-hChR2(H134R)-EYEP-WPRE- 
AGHpA, anda single fibre optic was implanted into the third ventricle atthe level of 
the arcuate (from bregma: AP = —1.70, DV = —5.75, ML= 0.00). For behavioural 
assays, animals were first allowed 48h to acclimate to their behaviour cage while 
tethered to fibre optic cord. After acclimation, mice were fasted overnight. In 
the morning, mice were presented with standard pre-measured pellet chow and 
food intake was recorded every 30min for 2h total either under conditions of light, 
stimulation (SmW, 10ms pulses, 20 Hz, 5 trains, 30s intervals) or no stimulation, 

‘Trials were randomized and conducted one week apart on the same animals. Paired 
statistics were used to compare ‘stim’ and ‘no-stim' conditions on these animals. 

Asa control group, Cre-negative male littermates were injected with virus and 
implanted in the same way as experimental mice. Behavioural assays on these 
mice were done the same way under conditions of light illumination, Comparisons 
between mock-stimulated and experimental cohorts were done using unpaired 
statistics. For terminal stimulation experiments in the presence of mecamylamine 
ocris, catalogue number 2843), mice were fasted overnight and presented with a 
pre-measured amount of chow in the morning. Mecamylamine was administered 
by ip. injection at 1 mg per kg 15min before the start of a 2-h feeding in the pres- 
ence or absence of blue light illumination, Control (sterile 1x PBS ip. injections) 
and experimental trials were conducted 1 week apart. 

Electrophysiology and pharmacology of POMC-EGFP or NPY-hrGFP neu- 
rons, ChR2-expressing DBB neurons, and hM4D-expressing DBB neurons. 
Brain slices containing the hypothalamic arcuate nucleus were prepared from 
6-8-week-old Pomc-EGEP*"~ or Npy-hirGFP*/~ transgenic mice of either gen- 
der. For ChR2 stimulation and hMdd-mediated inhibition, brain slices contain- 
ing the DBB were prepared from 12-16-week-old male animals expressing either 
(ChR2:EYFP or hM4D-EGEP in the DBB, respectively. Animals were anaes- 
thetized with isoflurane and brains were rapidly removed and transferred into 
sucrose-based cutting solution, containing (in mM): 250 sucrose, 25 NaHCOs, 
1.25 NaHsPO,, 2.5 KCI, 1.5 MgCl, 2 CaCl, 10 glucose, and continuously bub- 
bled with 5% CO; / 95% O2, 300-jum-thick coronal brain slices were prepared 
using a Leica VT 1200 vibratome and placed for recovery in a 5% CO, / 95% Ox 
bubbled regular ACSF solution, containing (in mM): 128 NaCl, 24 NaHCOs, | 
NaH,PO,, 3 KCl, | MgCl, 1.6 CaCl, 8 glucose. After at least 1-h recovery and 
20-30 min before recording, slices were transferred into a recording chamber con- 
tinuously perfused at 2ml min with aforementioned ACSF at 24°C. POMC-, 
NPY-, ChR2::EYFP-, and hM4D-expressing neurons were identified by trans- 
mitted light DIC and EGFP fluorescent imaging using a Slicescope Pro 6000 
optical setup (Scientifica), equipped with a CoolLED pE-100 470nm excitation 
light source, 49002- ET-EGEP (FITC/Cy2) emission filter (Chroma Technology), 
and optiMOS camera (QImaging), Electrical activity of neurons was recorded in 
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‘whole-cell current clamp mode using a Multiclamp 700b amplifier and a 1440a 
Digidata interface (Molecular Devices). Pipette solution contained (in mM): 10 
KCl, 120 K gluconate, 1 MgCl, 10 HEPES, | EGTA, 5 Na2-ATP, 0.01 Na-GTP, 
pH 7.2. To test the cholinergic effects on POMC and NPY neurons, baseline 
neuronal activity was prerecorded for atleast 10min to assure a stable firing rate, 
after which acetylcholine (Sigma-Aldrich, A6625) was added to the bath perfusion, 
at 100)iM. For ChR2 stimulation, 12 consecutive trains of blue light were given at 
30s intervals, Each train lasted for Ss with 10ms light pulses delivered at 20 Hz. 
For recordings from hM4D-expressing cells, we recorded 90 s baseline sweeps and 
delivered 5s current injections at 2 pA and 10pA, spaced 25s apart. This protocol 
‘was repeated again 6 min after CNO bath application, For all recordings, neuronal 
firing activity was analysed offline using the event detection feature of Clampfit 
10.3 software (Molecular Devices). Repeated measures ANOVA with Holm-Sidak 
multiple comparison, and Sigma Plot 11.0 software (Systat Software) were used 
for statistical analyses of data, where applicable. For acute acetylcholine responses, 
a localized 2s application of acetylcholine (100 mM, Sigma-Aldrich, A6625) was 
applied near a patched POMC-EGEP neuron (held at —70 mV) using a Femtofet 
(Eppendorf). Each recording was performed using a 20-s sweep with an inter-trial 
interval of I min and repeated for 5 sweeps each for baseline, synaptic blockers 
(10pM CNQX (Tocris), 201M APV (Tocris), 50}iM GABAzine (Tocris), and nic- 
‘otinic blockers (101M mecamylamine (Tocris), 0.1 1M methyllycaconitine citrate 
(Tocris), and 10M dihydro-s-erythroidine hydrobromide (Tocris) 

CAY-Cre tracing experiments. R26!!! animals were stereotaxically injected 
bilaterally with 70nl of CAV-Cre virus (purchased from the vector core at the 
Institut de Génétique Moléculaire de Montpellier) targeted to the arcuate nucleus 
(from bregma: AP = —1.70, DV = —5.80, ML= +0.20). Sections through the 
DBB were obtained and stained for ChAT using the identical ChAT IHC proto- 
col detailed previously. All sections were mounted using DAPI Fluoromount-G 
(Southern Biotech, 0100-20). Detection of fluorescent expression was performed 
using a Leica TCS SPE confocal microscope under a 10% or 20% objective. 
hM4D-mediated feeding assays. 12-week-old, male Chat-cre*~ mice were stero- 
taxically injected bilaterally into the DBB (500nl per hemisphere) with an hM4D~ 
EGEP-expressing AAV (AAV-CBA-FLEX-hM4Di-P2A-EGEP-WPRE-sv40pA, 
serotype 2/9, Addgene, plasmid number 52536). After a two-week recovery, mice 
‘were fasted overnight in their home cage, and food was presented in the morning. 
Food intake was measured every 30min for 2h total, in the presence or absence 
of CNO, For control experiments, mice were injected ip. with sterile saline and 
allowed to wait 15min before food presentation. For experimental conditions, mice 
‘were injected ip. with CNO (Smg per kg) and allowed to wait 15 min before food 
presentation and measurement, which was recorded for 2h in total. rials were 
randomized and conducted one week apart on the same animals. Paired statistics 
‘were used to compare ‘saline’ and ‘CNO’ conditions on these animals. 
‘Transcript analyses. For Agrp and Pomc transcript analysis from the arcuate 
nucleus, 4 DBB-ablated mice and 4 non-ablated mice were euthanized in the 
morning and their brains were immediately dissected. For ACHR transcript anal- 
ysis, 4 wild-type male mice were taken and brains dissected and processed iden- 
tically. Small sections of the ventral hypothalamus containing the arcuate nucleus 
‘were dissected out and RNA was isolated following the TRIzol (Life Technologies, 
15596-018) protocol for tissue homogenization and RNA isolation. In brie, tissue 
‘was placed in I ml of TRIzol reagent and homogenized using a 1,5 ml pre-sterilized 
pestle. Homogenized samples were allowed to incubate at room temperature for 
5min, 0.2ml of chloroform was added, and the tube was shaken vigorously by 
hand for 15s. The sample was again incubated at room temperature for 2min and 
centrifuged at 12,000g for 15 min at 4°C. The upper aqueous phase was pipetted 
into a new tube, 0.5 ml of isopropanol was added, and the mixture was incu- 
bated at room temperature for 10min. The tube was centrifuged at 12,000g for 
10min at 4°C, The supernatant was removed and the RNA pellet was washed 
with I ml of 75% ethanol. The sample was vortexed and centrifuged at 7,500¢ 
for 5 min at room temperature. The pellet was air-dried for 15 min and resus- 
pended in 40 il RNase-free water at 60°C. RNA was DNase-digested using the 
‘manufacturer's protocol (Promega). DNase was inactivated via phenol-chloroform 
extraction. Purified RNA was quantified using a NanoDrop (Wilmington, DE), 
and first-strand cDNA was synthesized using the High Capacity cDNA Reverse 
‘Transcription Kit (Applied Biosystems). Negative controls did not contain reverse 
transcriptase. Transcripts were amplified using standard PCR conditions (95°C. 
for 1205, 95°C for 20s, 60°C for 20s, 72°C for 20s, 34 cycles, 7°C for 3005, 
4°C until storage at —20°C), Amplified products were run on 2% agarose gels, 
imaged, and quantified using Image] software. AChR primer sequences were 
as follows (forward primer first, reverse primer second, in 5'-3'orientation), 
CHRNA1: GICCAATAACGCCGCTGAGG, CTAGCGATGGCTATGGCTGG; 
CHRNA2; GACTCTTCGGTGAAGGAAGATTG, AGAGCAGAAGA 
TGGTTGTCCAG; CHRNA3: GCCAAAGAGATTCAAGATGATTG 
TCTGGGGCTATTGAGAAAGTGC; CHRNA4: GACTTCTCGGTGAAG 
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GAGGAC, GGAAGATGTGGGTGACTGACG; CHRNAS: CGTCCGCG 
AGGTTGTTGAAG, AGCTGCTTGACTGCTCACTAAG; CHRNA6: 
CAAACGAGGTATAAGACGACTG, TCTTGTGGGGCTAGCTCGG; CHRNA7: 
CCTAAGTGGACCAGGATCATTC, ATGTAGAGCAGGTTGCCATTGC; 
CHRNA9: GICCCTCTGATAGGAAAATACTAC, CTAAGGCAGC 
‘TCTCACCCAC; CHRNALO: ACTCATCGGAAAGTACTATATGGC, 
GACTCTAATGGCTTGGACTGTC; CHRNBI: ACCAGATGCAGGAGAGA 
AGATG, GAGCGATGATGCAGGTTGAGG; CHRNB2: TGACCAGAGTG 
‘TGAGGGAGG, AGCTGCAAATGAGAGACCTCAG; CHRNB3: ACTTCATC 
AGTCAGGTTGTTCAAG, CTAGGTGGGATTCTCTCTATGTG; CHRNBA: 
ATCAGAGTGTCATCGAGGACTG, CACTAGGCTGCTCATATCATCC; 
CHRM1: GCCAAGGTGATGCCCTTACTC, TGCCTGTCACTGTAGCCAGAG; 
CHRM2: AGAGCCCTGAAGTCGCAGATC, CTCCCTGGATCTGGCTT. 
'TCAG; CHRM3: GGCTTCCTGGCATTGGTGAC, GCCAGAGGTCACAGG 
CTAAG; CHRM4: TGACTGGTTCCCTGAGCCTG, AGTAGCCCTTGATGAT 
GTATAAGG; CHRMS: ACTATTACCTGCTCAGCTTGGC, GTAACGATCAA 
AGCTAATCACCAG. 

AgRP products were quantified by qPCR using the following primer pair: 
GCGGAGGTGCTAGATCCACAGAA and AGGACTCGTGCAGCCTTACAC. 
POMC products were quantified by qPCR using the following primer pair: 
AGAACGCCATCATCAAGAAC and AAGAGGCTAGAGGTCATCAG. 
Actin was used as a reference control using the following primer pair: 
GCAAGCAGGAGTACGATGAG and TAACAGTCCGCCTAGAAGCA. For 
‘quantitative transcript analysis, all reactions were done in triplicates with no reverse 
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transcriptase negative control samples, Samples were prepared according to the 
BIO-RAD iQ SYBR Green Supermix instructions, Briefly, CDNA was diluted to 
Ingul- and primers were diluted to 2.5 1M stock of combined forward and reverse 
primers. Reactions were set up in 154: total volume per sample (7.5yi1 SYBR Green 
Supermix, 5jl CDNA, 1.5yil forward and reverse primer mix, | jl water) in Applied 
Biosystems Advanced Studio 3 compatible 96-well plates. Plates were sealed with 
adhesive film and mixed thoroughly before amplification, Amplification occurred 
on the Applied Biosystems Advanced Studio 3 qPCR machine on its standard 
amplification protocol, All transcript analysis was standardized to the amplification 
curve of actin for each sample, and a Student's t-test was performed to analyse 
differences in transcript expression among samples. 

Statistical analyses. With the exception of electrophysiological-based experi- 
ments (previously described), all other statistical analyses were performed using 
GraphPad Prism 6 software (GraphPad), accounting for appropriate distribution 
and variance to ensure proper statistical parameters were applied. Experimental 
sample sizes were chosen according to minimal accepted norms within the field, 
With regards to experimental randomization for cell ablation, Chat conditional 
knockout, and optogenetic behaviour assays, mice were randomly separated into 
two groups before manipulation by an independent, blinded assistant not involved. 
in experimentation or experimental design. For quantification of Chat conditional 
knockout in the DBB, DBB images were acquired by an independent assistant not 
involved in the experimentation and cell counts were then objectively tallied by a 
second assistant without knowledge ofthe experimental groups. Statistical methods 
used are described in figure legends for the respective experiments. 
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Extended Data Figure 1 | Anatomy of the diagonal band of Broca. expression throughout the full extent of the diagonal band (VDB and 
a-c, The DBB is situated in the basal forebrain and consists of an HDBs). eg, Bilateral viral injections into the HDB (AAV-FLEX-Syn:EGFP 
anteriorly located vertical limb (VDB), which branches posteriorly into shown as an example) are sufficient to target the full extent of the diagonal 
separate bilateral horizontal limbs (HDB). d, AAV-FLEX-tdTomato band (VDB and HDB). Scale bars, 100 jim. Schematic in a was adapted 
(pseudocoloured in green) injected bilaterally into the HDB to show from images from the Allen Brain Institute Reference Atlas™. 
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re2 | Time-dependent changes in body content 
and metabolic measures between non-ablated and DBB-ablated mice. 
a, Oxygen consumption, represented as VO>, at various stages post DBB- 
ablation. Data are represented as mean + s.e.m. *P < 0,05 by two-sided, 
unpaired Student’ t-test. b, ¢, Average total lean mass (b) or body fat 
(c) content at various stages post DBB-ablation. Data are represented as 
mean + s.e.m. “P-<0.05 by two-sided, unpaired Student’ t-test. d-f, Blood 
cholesterol (d), leptin (e), insulin (f), and blood glucose concentration (g) 


at various stages post DBB-ablation. Data are represented as 

mean + 5.e.m, *P < 0.05 by two-sided, unpaired Student's t-test. 

h, i, Representative non-ablated (h) or DBB-ablated (i) animals 

(=3 mice) dissected to show abdominal fat pads and enlarged fatty 
livers. j, k, Representative haematoxylin and eosin (H&E) stains showing 
fat deposition of the liver in non-ablated (j) or DBB-ablated (k) animals. 
Scale bar, 100 jum, 
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Extended Data Figure 3 | Cholinergic DBB neurons are VGAT-positive. a-c, Vgat-cre*/~ mouse injected with AAV-FLEX-td Tomato into the HDB and 


stained with anti-ChAT antibody (green channel). Data suggest a high percentage of co-localization between VGAT-positive neurons and ChAT 
Seale bar, 100m. 
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Extended Data Figure 4 | hM4D-mediated inhibition of the cholinergic 
DBB increases food intake. a, Whole-cell electrophysiological recordings 
from a representative cholinergic neuron expressing the hM4D inhibitory 
DREADD receptor before (top) and after (bottom) CNO treatment. 


b, Total two-hour food intake after an overnight fast (1"=11 mice). 
‘Saline’ and ‘CNO" groups represent separate trials of the same animals 
in the absence or presence of CNO, respectively. Data are represented as 
mean + s.e.m, *P-< 0.05 by two-sided, paired Student’ t-test. 
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Extended Data Figure 5 | Representative cholinergic DBB-innervating __(c/-e’). fh, Cholinergic projections from the DBB can be validated by 
projection sites in the brain. a, b, Schematic (a) and representative known projection sites including the olfactory bulb. Scale bar, 200m; 
image (b) showing injection site in the HDBs of a Chat-cre“’~ mouse with GL, glomerular layer; EPL, external plexiform layer; MCL, mitral cell 
AAV-FLEX-Syn::EGFP. Scale bar, 1 mm, -e, Chat-cre*!~ mouse injected layer; GCL, granule cell layer. i-k, Hippocampus. Scale bar, 200 um. 
bilaterally into the HDB with AAV-FLEX-Syn:EGEP and stained with an __I-n, Amygdala (basolateral amygdala shown here). Scale bar, 101m. 


anti-ChAT antibody. Staining shows high co-localization of ChAT-positive _ 0-u, Also shown are areas of the ventral hypothalamus including, Median 
neurons with the Cre-dependent AAV, demonstrating the efficacy ofthe eminence (0-4), and arcuate nucleus (r-u). Scale bars, 100 1m. Schematic 
Chat-cre mouse line, Of the neurons infected with virus, only a small in a was adapted from an image from the Allen Brain Institute Reference 
fraction do not express ChAT. Scale bars, 100 jim (¢-e) and 50 jm Atlas® 
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Extended Data Figure 6 | Pomc expression is reduced in pair-fed DBB- 
ablated animals. a, Relative Agrp and Pome expression levels in arcuate 
nuclei between pair-fed non-ablated and DBB-ablated mice (n=4 mice 
per group). Data are represented as mean + s.e.m, *P < 0.05 by two- 
sided, unpaired Student’ t-test. b, Acetylcholine does not significantly 
alter arcuate NPY neuron firing. Data shown as mean + s.e.m. of spike 
frequency in 8 acetylcholine-treated cells (1 recorded neuron per slice, 
1-2slices per mouse, n=6 mice), comparisons made versus baseline 
values (1-4 min) by repeated measures ANOVA with Holm-Sidak 
multiple comparison. c, d, Acetylcholine significantly increases arcuate 
POMC neuron firing. Data shown as mean + s.e.m. of spike frequency 
in 7 acetylcholine-treated cells (1 recorded neuron per slice, 1-2 slices 
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per mouse, n=4 mice), "P< 0,001 versus baseline values (1-4 min) by 
repeated measures ANOVA with Holm-Sidak multiple comparison. 

¢, Representative voltage clamp recording from POMC-EGEP neuron 
after local administration of acetylcholine and after pharmacological 
manipulation of fast synaptic transmission, as well as nicotinic 
acetylcholine receptor blockade (20-s sweep with an inter-trial interval 
of I min and repeated for 5 sweeps each for baseline, synaptic blockers 
(CNQX, 2-amino-5-phosphonopentanoic acid (APV), GABAzine and 
tetrodotoxin (TTX)), and nicotinic blockers (mecamylamine (MEC), 
methyllycaconitine (MLA) and dihydro-3-erythroidine hydrobromide 
(DBHE)). 
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Extended Data Figure 7 | Diffuse cholinergic DBB projections into the 
hypothalamus. a, Site of AAV-FLEX-Syn::mRuby2 injections in the DBB 
b-d, Representative images of cholinergic terminals (red) from the DBB 
into the hypothalamus. Scale bar, 100 jim, ME, median eminence. 

b/-d’, Higher magnification images from the section shown in the dotted 
squares in bd. Scale bar, 50m. Closed arrowheads indicate close 
apposition or co-localization between cholinergic terminals and POMC 
(green) cell bodies, whereas open arrowheads show apposition or 


co-localization on or near POMC neuronal processes. Of note, cholinergic 
terminals also appear on non-POMC neurons. e, f, DAPI (e), anti-- 
endorphin (f) (POMC neuron marker), g, h, AAV-FLEX-Syn:smRuby2 
(injected into the DBB) (g), and merged (h) channels from fand g show 
innervation of the hypothalamus from cholinergic neurons in the DBB. 
Scale bar, 200 jum. Schematic in a was adapted from an image from the 
Allen Brain Institute Reference Atlas™ 
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Extended Data Figure 8 | ChR2-mediated decrease in feeding is 


partially suppressed by nAChR blockade. a, b, Total (a) and normalized 
(with respect to the mean ‘No-stim’ value; b) food intake after a two-hour 
-eding after an overnight fast under specified conditions. Comparisons 
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separate trials conducted on the same animals (n=5 mice). Data are 
represented as mean + s.e.m. *P < 0.05 by two-sided, paired Student's 


Mest. 


Inger Nature. All rights reserved 


LETTER 


ACHR Expression Profile 


120 120 


rea I Ge 


5 3: 
Ha I I oF: 
ie 60 i ak ab eo ge 
E I I I I eg 
3 Be 
22 40 a 88 
se I 2: 

20 I 20 

> x 

ote aan ro T rt ° 

sre ° res FMP KL 

< 

é 

TAGHR subunits THAGKRS 


Extended Data Figure 9 | Arcuate acetylcholine receptor (ACR) expression profile. a, Region-specific transcript expression profile of nicotinic and 
muscarinic AChRs in the arcuate nucleus, relative to the housekeeping gene, Gapdh (n = 4 mice), Data are represented as mean + s.e.m. 


© 2016 Macmillan Publishers Limited, part of Springer Nature, All rights reserved. 


LETTER 


doi:10.1038/nature19793 


Evidence for a limit to human lifespan 


Xiao Dong"*, Brandon Milholland' & Jan Vijg!? 


Driven by technological progress, human life expectancy has 
increased greatly since the nineteenth century. Demographic 
evidence has revealed an ongoing reduction in old-age mortality and 
a rise of the maximum age at death, which may gradually extend 
human longevity'”. Together with observations that lifespan in 
various animal species is flexible and can be increased by genetic or 
pharmaceutical intervention, these results have led to suggestions 
that longevity may not be subject to strict, species-specific genetic 
constraints. Here, by analysing global demographic data, we show 
that improvements in survival with age tend to decline after age 100, 
and that the age at death of the world’s oldest person has not increased 
since the 1990s. Our results strongly suggest that the maximum. 
lifespan of humans is fixed and subject to natural constraints. 
Maximum lifespan is, in contrast to average lifespan, generally 
assumed to be a stable characteristic of a species’. For humans, the 
maximum reported age at death is generally set at 122 years, the 
age at death of Jeanne Calment, still the oldest documented human 
individual who ever lived*. However, some evidence suggests that 
maximum lifespan is not fixed. Studies in model organisms have shown 
that maximum lifespan is flexible and can be affected by genetic and 
pharmacological interventions®. In Sweden, based on a long series 
of reliable information on the upper limits of human lifespan, the 


maximum reported age at death was found to have risen from about 
101 years during the 1860s to about 108 years during the 1990s°. 
‘According to the authors, this finding refutes the common assertion 
that human lifespan is fixed and unchanging over time®. Indeed, the 
most convincing argument that the maximum lifespan of humans is not 
fixed is the ongoing increase in life expectancy in most countries over 
the course of the last century!”. Figure Ia shows this increase for France, 
a country with high-quality mortality data, but very similar patterns 
were found for most other developed nations (Extended Data Fig. 1). 
Hence, the possibility has been considered that mortality may decline 
further, breaking any pre-conceived boundaries of human lifespan!” 
As shown by data from the Human Mortality Database’, many 
of the historical gains in life expectancy have been attributed to a 
reduction in early-life mortality. More recent data, however, show 
evidence for a decline in late-life mortality, with the fraction of each 
birth cohort reaching old age increasing with calendar year, In France, 
the number of individuals per 100,000 surviving to old age (70 and up) 
has increased since 1900 (Fig. 1b), which points towards a continuing 
increase in human life expectancy. This pattern is very similar across 
the other 40 countries and territories included in the database 
(Extended Data Figs 2, 3). However, the rate of improvement in survival 
peaks and then declines for very old age levels (Fig. 1c), which points 
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Figure 1 | Trends in life expectancy and late-life survival. a, Life 
expectancy at birth for the population in each given year. Life expectancy 
in France has increased over the course of the 20th and early 21st 
centuries. b, Regressions of the fraction of people surviving to old age 
demonstrate that survival has increased since 1900, but the rate of 
increase appears to be slower for ages over 100. ¢, Plotting the rate of 


change (coefficients resulting from regression of log-transformed data) 
reveals that gains in survival peak around 100 years of age and then 
rapidly decline. d, Relationship between calendar year and the age that 
experiences the most rapid gains in survival over the past 100 years, The 
age with most rapid gains has increased over the century, but its rise has 
been slowing and it appears to have reached a plateau. 
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Figure 2 | Reported age at death of supercentenarians. All data were 
collected ftom the IDL database (France, Japan, UK and US, 1968-2006). 

a, The yearly maximum reported age at death (MRAD). The lines represent 
the functions of linear regressions. b, The annual Ist to 5th highest reported 
ages at death (RAD). The dashed lines are estimates of the RAD using cubic 


towards diminishing gains in reduction of late-life mortality and a 
possible limit to human lifespan. The same pattern was found across 
other developed, low-mortality countries (Extended Data Fig. 4). 
However, we considered the possibility that the age experiencing the 
greatest increase in survivorship increases with calendar years; that is, 
the peak in the rate of increase in survivorship to old age will shift to 
the right over time, To test this, we plotted the age at which this peak 
‘occurred against calendar years (Fig, 1d). The results indicate that the 
age with greatest improvement in survival appeared to plateau around 
1980. A similar pattern was seen in 88% of the 41 countries in the 
database (Extended Data Fig. 5). Together, these findings suggest, but 
do not prove, that human lifespan may have a natural limit. To further 
investigate this idea, we turned our attention from late-life mortality 
to maximum human lifespan itself and examined the ages at death of 
the world’s oldest individuals. 

We first plotted the yearly maximum reported age at death (MRAD) 
of France, Japan, UK and US, countries with the largest number of 
recorded supercentenarians (individuals aged 110 years old or more) 
in the International Database on Longevity” (IDL; n= 534, 1968-2006). 
As shown in Fig. 2a, although age at death increased rapidly between the 
1970s and early 1990s, it reached a plateau around 1995, close to 1997, 
the year Jeanne Calment died. We partitioned the data into two groups 
(1968-1994 and 1995-2006) and modelled each group using linear 
regression. The results indicate a trend break between the two groups. 
Before 1995, the MRAD increased by 0.15 years per year (r= 0.68, 
P=0.0007); however, after 1995 it no longer increased significantly 
and in fact decreased slightly by 0.28 years per year (r= —0.35, 
P=0.27), When we considered MRAD records from another, 
independent resource, the Gerontological Research Group (GRG; 
hitp://www.grg.org/), we observed a similar trend—an increase by 
0.12 years per year (r=0.71, P= 0.0002) during the period 1972-1994, 
followed by a slight decrease by 0.14 years per year (r= —0.36, P=0.70) 
during the period 1995-2015 (Extended Data Fig. 6). These results 
indicate that although the MRAD increased until the 1990s, no further 
increases were observed after that time; human yearly MRAD has 
plateaued at 114.9 (95% Cl: 113.1-116.7) years. To approximate the 
absolute limit of human lifespan, we modelled the MRAD as a Poisson 
distribution; we found that the probability of an MRAD exceeding 125 
in any given year is less than 1 in 10,000. 

One potential confounder of our results is the fairly small number 
of reported MRAD cases, which could explain these results simply as 
fluctuations. To provide a robust statistical model that would strengthen 
the observed pattern, we considered several series of high reported age 
at death (HRAD), that is, the highest RAD (MRAD) and the second 
to the fifth highest RADs (Fig. 2b; data summarized from IDL). 
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smoothing splines, The red dots represent the MRAD. ¢, Annual average 
age at death of supercentenarians (110 years plus, 1 = 534). The solid line is 
the estimate of the annual average age at death of supercentenarians, using a 
cubic smoothing spline 


All series showed the same pattern as the MRAD. Notably, even 
the annual average age at death for these supercentenarians has not 
increased since 1968 (Fig. 2c). 

Hence, in contrast to previous suggestions that human longevity can 
be extended ever further", our data strongly suggest that the duration 
of life is limited. In the past, others have suggested that human lifespan 
is limited, For example, in 1980 Fries argued that increased prevention 
of premature deaths would lead to a compression of morbidity owing 
toa finite lifespan", However, his arguments for such a limit to life, that 
is, the lack of a detectable increase in centenarians or in the maximum 
reported age at death, while correct at that time, have been refuted 
since™ Ten years later, Olshansky et al." estimated the upper limits to 
human longevity based on hypothetical reductions in mortality rates, 
concluding that life expectancy at birth would not exceed 85 years. 
Like Fries, Olshansky et al, also suggested a biological limit to life 
based on the lack of an increase in the age of the verified longest-lived 
individual. However, as they mention, insufficient data prevented them 
from drawing definite conclusions. Now, more than two decades later, 
such data are becoming available. With the caveat that the ages at death 
of the supercentenarians in our present study are still noisy and made 
up of small samples, we feel that the observed trajectories in Fig. 2 are 
compelling and our results strongly suggest that human lifespan has 
a natural limit. 

What could be the biological causes of this limit to human lifespan? 
The idea that ageing is a purposeful, programmed series of events that 
evolved under the direct force of natural selection to cause death has now 
been all but discredited”, Instead, what appears to bea ‘natural limit’ is 
an inadvertent byproduct of fixed genetic programs for early life events, 
such as development, growth and reproduction. Limits to the duration 
of life could well be determined by a set of species-specific, longevity- 
assurance systems encoded in the genome that counteract these inad- 
vertent byproducts, which are likely to include inherent imperfections 
in transferring genetic information into cellular function", To further 
extend human lifespan beyond the limits set by these longevit 
systems would require interventions beyond improving health span, 
some of which are currently under investigation, Although there is no 
scientific reason why such efforts could not be successful, the possibility 
is essentially constrained by the myriad of genetic variants that collec- 
tively determine species-specific lifespan'®, 


assurance 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only inthe online paper. 
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Extended Data Figure 1 | Life expectancy over time since 1900 in many of the other countries. The data represent the entire population 
(or the earliest year for which data was available) in 40 countries and for each region, except Scotland, where it represents only the civilian 
territories. There is a generally positive trend over time; life expectancy __ population, The colour scheme is as in Fig. La. 


in Japan appears to be reaching a plateau, but the increase looks unabated 
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Extended Data Figure 4 | Rate of change in survival since 1900 (or the equation y=a-+ bx, where x is age and y is the logarithm of the number 
earliest year for which data was available) to a given age as afunction _of survivors to that age per 100,000. Including France, 90% (37/41) 

of that age in 40 countries and territories. The rate of change is the of the regions examined exhibited the pattern depicted in Fig. Le. 

slope of the line calculated by an exponential regression, that is, bin the 
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Extended Data Figure 5 | Age with the greatest increase in survival as a 
function of calendar year in 40 countries and territories. For each year, 
the age with the greatest increase in survival over the past 100 years 

(or since the earliest year for which data was available), that is, the peak 
of a graph like that from Extended Data Fig. 4, was determined. Including 
France (Fig. 1d), a total of 82 data sets were considered (males and females 
in each region); we used linear regressions of segments of the data to look 
for evidence of plateaus. A data set was considered to be plateauing if one 
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of the following criteria applied: the second half of the data had a negative 
slope; the first half of the data had a negative slope (as an increase in the 
second half would likely reflect a return to some equilibrium after being 
negatively perturbed); the first half of the data had a slope greater than that 
of the second half of the data; or the final 10% of the data had a slope less 
than that of the preceding 40%. In 88% (72/82) of the data sets, there was 
evidence of a plateau. 
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Extended Data Figure 6 | The yearly maximum reported age at death 
from the GRG database (worldwide, 1972-2015). The lines represent the 
functions of linear regressions, 
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The gradual accumulation of genetic mutations in human adult 
stem cells (ASCs) during life is associated with various age-related 
diseases, including cancer!™, Extreme variation in cancer risk across 
tissues was recently proposed to depend on the lifetime number 
of ASC divisions, owing to unavoidable random mutations that 
arise during DNA replication’. However, the rates and patterns of 
mutations in normal ASCs remain unknown, Here we determine 
genome-wide mutation patterns in ASCs of the small intestine, 
colon and liver of human donors with ages ranging from 3 to 87 
years by sequencing clonal organoid cultures derived from primary 
multipotent cells*-*. Our results show that mutations accumulate 
steadily over time in all of the assessed tissue types, at a rate of 
approximately 40 novel mutations per year, despite the large 
variation in cancer incidence among these tissues!. Liver ASCs, 
however, have different mutation spectra compared to those of the 
colon and small intestine. Mutational signature analysis reveals 
that this difference can be attributed to spontaneous deamination 
of methylated cytosine residues in the colon and small intestine, 
probably reflecting their high ASC division rate. In liver, a signature 
with an as-yet-unknown underlying mechanism is predominant. 
‘Mutation spectra of driver genes in cancer show high similarity to 
the tissue-specific ASC mutation spectra, suggesting that intrinsic 
mutational processes in ASCs can initiate tumorigenesis. Notably, 
the inter-individual variation in mutation rate and spectra are low, 
suggesting tissue-specific activity of common mutational processes 
throughout life. 

Ithas not yet been possible to measure somatic mutation loads in 
ASCs from specific human tissues, However, such knowledge could 
be valuable in understanding tissue homeostasis and repair capacities 
as well as ASC vulnerabilities to extrinsic factors. The accumulation 
of mutations as life progresses is thought to underlie the genesis of 
age-related diseases such as cancer® and organ failure”. Mutations 
acquired in the genomes of multipotent ASCs are believed to have 
the largest impact on the mutational load of tissues, owing both to 
their potential for self-renewal and capacity to propagate mutations 
to their daughter cells', Consistently, cancer-initiating mutations 
in intestinal ASCs lead to tumour formation within weeks, whereas 
these mutations fail to drive intestinal adenomas when induced in 
differentiated cells’. Unavoidable random mutations that arise during 
DNA replication in normal ASCs have recently been proposed to 
impart a large influence on cancer risk!. Consequently, tissues with 
a high ASC turnover would show higher cancer incidence when 
compared to tissues with low ASC proliferation rates", However, 


computational modelling has suggested that the variation in ASC pro- 
liferation rate alone cannot exclude extrinsic risk factors as important 
determinants of organ-specific cancer incidence’, Yet, the number 
of mutations that accumulate during the lifespan of normal human 
‘ASCs with different turnover rates has, to date, not been directly 
determined and compared. To understand tissue homeostasis and 
tissue-specific susceptibility to cancer and ageing-associated diseases 
it is important to assess mutation accumulation in ASCs of different 
tissues. 

Here, we experimentally define ASCs as those cells that give rise 
to long-term organoid cultures and have the potential to differen- 
tiate into multiple tissue-specific cell types*-*, To catalogue the 
in vivo-acquired somatic mutations in individual normal human ASC 
genomes, we used an in vitro system to expand single ASCs into epi- 
thelial organoids, which reflect the genetic make-up of the original 
‘ASC (Extended Data Fig. 1a and Methods). This procedure allowed 
us to obtain sufficient DNA for accurate whole-genome sequencing 
(WGS) analysis, while circumventing the high noise levels associated 
with single-cell DNA amplification, We assessed ASCs from the 
small intestine, colon and liver, tissues that differ greatly in prolifer- 
ation rate and cancer risk!. Cancer incidence is much higher in the 
colon compared to the small intestine and liver'. We sequenced 45 
independent clonal organoid cultures derived from 19 donors rang- 
ing in age from 3 to 87 years (Extended Data Table 1). In addition, 
we sequenced a blood or polyclonal biopsy sample of each donor to 
identify and exclude germline variants, Subclonal mutations, which 
must have been introduced in vitro after the single-cell step, were dis- 
carded based on their low variant-allele frequency (Extended Data 
Figs Ib-d, 2 and Methods). Overall, we identified 79,790 heterozygous 
clonal somatic point mutations and subsequent extensive validations 
showed an overall confirmation rate of approximately 91% (Extended 
Data Figs 1, 3). 

A positive correlation (t-test linear mixed model; P< 0.05) between 
the number of somatic point mutations and the age of the donor could 
be observed for all organs (Fig. la and Extended Data Fig. 4), indicating 
that ASCs gradually accumulate mutations with age, independent of 
tissue type. Notably, we found that the annual mutation rate in ASCs 
was in the same range for all assessed tissues, despite the dissimilar 
cancer incidence in these tissues; ASCs of the colon, small intestine 
and liver accumulate around 36 mutations per year (95% confidence 
intervals are 26.9-50.6, 25.8-43.6 and 11.9-60.1, respectively; Fig. 1b). 
‘The mutation spectra in small intestinal and colon ASCs were very sim- 
ilar, but differed markedly from liver (Fig. 1c). Notably, the mutation 
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Figure 1 | Age-associated accumulation of somatic 
point mutations in human ASCs. a, Correlation of 
the number of somatic point mutations in each ASC 
type examined (extrapolated to the whole autosomal 
genome) with age of the donors per tissue. Each data 
point represents a single ASC. The P values of the age 
effects in the linear mixed model (two-tailed t-test) 
are indicated for each tissue. The sample sizes 

for colon, small intestine and liver ASCs are 6, 9 

and, 5 donors, with, in total, 21, 14 and 10 ASCs, 
respectively. b, Somatic mutation accumulation rate 
per tissue as estimated by the linear mixed models in 
a. Error bars represent the 95% confidence intervals 
of the slope estimates. ¢, Relative contribution of 

the indicated mutation types to the point mutation 
spectrum for each tissue type. Data are represented as 
the mean relative contribution of each mutation type 
over all ASCs per tissue type (= 21, 14 and 10 for 
colon, small intestine and liver, respectively) and error 
bars represent standard deviation. The total number 
of identified somatic point mutations per tissue is 
indicated. 
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spectrum within tissues did not differ between young and elderly 
donors (Extended Data Fig. 5). 

Genome-wide mutation patterns in the ASCs provide insights into 
the mutational and DNA repair processes that are active in different 
organs'!, Using non-negative matrix factorization”, we extracted 
three mutational process signatures (Fig. 2a and Methods). All of 
these signatures were previously described in a pan-cancer analysis", 
Signature A (corresponding to signature 5 in ref. 11), characterized 
by T:A to C:G transitions, was the main contributor to the mutation 
spectrum observed in the liver and was also clearly present in the 
small intestine and colon (Fig. 2). Although the underlying muta- 
tional process remains unknown, the number of mutations attributed 
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Figure 2 | Signatures of mutational processes in human ASCs and 
their tissue-specific contribution. a, Contribution of context-dependent 
mutation types to the three mutational signatures that were identified by 
non-negative matrix factorization (NMF) analysis of the somatic mutation 
collection observed in the ASCs across all assessed tissues. The contribution 
of each trinucleotide (order is similar to that in ref. 11) to each signature 

is shown. For each signature, the presence of transcriptional-strand bias, 
DNA-replication-timing bias and/or association with the culture system, 
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to this signature that accumulate with age resembles a linear trend 
in all tissues (Fig. 2b). This suggests that this signature represents a 
universal genomic ageing mechanism (that is, a chemical process 
acting on DNA molecules) independent of cellular function or pro- 
liferation rate. 

The majority of the somatic mutations observed in small intestinal 
and colon ASCs could be attributed to signature B (corresponding 
to signature 1A in ref. 11), which is characteristic of spontaneous 
deamination of methylated cytosine residues into thymine at CpG sites 
(Fig. 2a). The resulting T:G mismatch can be effectively repaired, but 
the mutation is incorporated if DNA replication occurs before the 
repair is initiated" In line with this, high rates of signature B mutations 
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is indicated. b, Absolute contribution of each mutational signature type 
(extrapolated to the whole autosomal genome) plotted against the age of the 
donors for each tissue. Each data point represents a single ASC. The P values 
of the age effects per tissue are shown (linear mixed model, two-tailed t-test). 
¢, Signature-specific mutation rate per year per genome for each tissue as 
estimated by the linear mixed model in b. Error bars represent the 95% 
confidence intervals of the slope estimates. 
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Figure 3 | Non-random genomic distribution of somatic point 
mutations in ASCs. a, Enrichment and depletion of somatic point 
mutations in the indicated genomic regions for each tissue. The log 
ratio of the number of observed and expected point mutations indicates 
the effect size of the enrichment or depletion in each region. *P<0.05, 


are observed in many cancer types of epithelial origin with high cell 
turnover". This process showed a minimal contribution to the 
age-related mutational load in liver ASCs (Fig. 2c), which is likely to 
reflect the relatively low division rate of these cells during life. Finally, 
contribution of a third signature, signature C (corresponding to 
signature 18 in ref. 11), was minimal in all tissues and did not corre- 
late with age (Fig. 2b). Sequential clonal ASC expansions in culture 
followed by WGS analysis showed that in vitro-induced mutations are 
predominantly characterized by this signature (Extended Data Fig. 6 
and Methods). 

Signature B mutations were strongly associated with the timing 
of replication and predominantly present in late-replicating DNA 
(Extended Data Fig, 7) even though the majority of CpG dinucle- 
otides are located in early-replicating DNA. This bias suggests that 
this mutagenic process is more active in late-replicating DNA or, 
alternatively, that replication-coupled repair shows reduced activ- 
ity in late-replicating DNA“. Consequently, somatic mutations 
in small intestine and colon ASCs were strongly enriched in late- 
replicating DNA and depleted in early-replicating DNA (Fig, 3a, b). In 
addition, somatic point mutations in small intestine and colon ASCs 
were depleted in H3K27ac (histone H3 acetyl Lys27)-associated DNA. 
and enriched in H3K9me3 (histone H3 trimethyl Lys9)-associated 
DNA (Fig. 3a), similar to patterns previously observed in cancer'®, 
‘As genic regions are predominantly located in early-replicating DNA 
and open chromatin, we observed a depletion of mutations in exonic 
sequences (Fig. 3a). This demonstrates that genome-wide mutation 
rates and spectra cannot be reliably estimated using mutation dis- 
covery in reporter genes", such as the T-lymphocyte HPRT cloning 
assay'’, or by deep sequencing of genic regions'*", To test whether 
the depletion of coding mutations was caused by selection against 
cells with damaging mutations, we calculated the ratio of non- 
synonymous to synonymous mutations (4N/dS) taking into account 
the mutation spectra and sequence composition (see Methods)"*. We 
did not observe negative selection for non-synonymous mutations 
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one-sided binomial test. b, Distribution of DNA replication timing for all 
genomic positions and the somatic point mutations detected in human 
ASCs per tissue. ¢, Relative contribution of each point-mutation type on 
the transcribed and untranscribed strand for each tissue. *P < 0.05, two- 
sided Poisson test. 


(Extended Data Fig. 7f), arguing against the negative selection of cells 
with damaging protein-coding mutations. 

In liver ASCs, somatic mutations are more randomly distributed 
throughout the genome and are less associated with replication timing 
or chromatin status (Fig. 3a). Nevertheless, a comparable depletion of 
exonic mutations was observed in all tissues (Fig. 3a), suggesting that 
liver ASCs use different mechanisms to maintain genetic integrity in 
functionally relevant regions. Signature A, the most predominant in 
liver ASCs, shows little bias towards DNA-replication-timing dynamics, 
buta pronounced transcriptional-strand bias"! (Extended Data Fig. 7), 
consistent with activity of transcription-coupled repair”. In line with 
this, point mutations in the genic regions of the assessed ASCs showed 
a significant transcriptional strand bias, exemplified by the more fre- 
quent occurrence of T:A to C:G transitions on the transcribed strand 
compared to the untranscribed strand (Fig. 3c). 

Our results indicate that a stable balance between the degree of DNA 
damage and the subsequent repair is maintained throughout life in 
various ASC types, since mutations accumulate steadily and display 
constant mutation spectrum, Earlier work in mice using mutation- 
discovery in a LacZ reporter gene, showed major age-related changes 
in mutation spectra in different tissues”, The difference between these 
observations could be explained by the comprehensive genome-wide 
analysis applied here to ASCs, whereas reporter assays assess specific 
genes predominantly in differentiated cells, Although variation in tissue- 
specific mutation spectra in mice has been reported previously”, we 
observed a difference in both mutation rate and spectrum in human 
cells (Extended Data Fig. 8). This indicates that mutation data derived 
from mice are not necessarily suitable for interpreting mutational pro- 
cesses and their consequences in humans. 

Although we analysed cells from many different donors without 
controlling for lifestyle differences or gender, the point-mutation rate 
and spectrum were highly similar between individuals within organs. 
‘This suggests that incidental exposure to environmental mutagenic 
factors has minimal effect on the point-mutation landscapes in 
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Figure 4 | Cancer-associated mutation spectra in driver genes and 
structural variation in normal ASCs. a, Spectrum of point mutations 

in cancer driver genes APC, TP53, SMAD4 and CTNNBI identified in 
colorectal and liver cancer. The total number of somatic point mutations 
pet gene per cancer type is indicated. b, Read-depth analysis indicating 

a relatively small deletion (~90kb) located within a common fragile site 
(FRA2I in intestinal ASC 11-a. Each point represents the logs value of the 
GC-corrected read-depth ratio per 5-kb window. Dashed lines indicate 
breakpoint regions; a schematic representation of the identified structural 
variant with associated genomic and breakpoint features is depicted below. 


normal ASCs of the organs we assessed. Cell-intrinsic mutational 
processes, such as deamination-induced mutagenesis in rapidly 
cycling ASCs, seem to be more important determinants of point- 
mutation load. Indeed, many colorectal cancer mutations in the driver 
genes APC, TP53, SMAD4 and CTNNBI are C:G to T:A transitions 
at CpG dinucleotides, whereas liver cancer driver mutations in the 
same genes have a completely different spectrum (Fig. 4a). However, 
‘ASCs of the colon and small intestine show very similar age-related 
mutation characteristics, although cancer incidence is extremely 
low in the human small intestine’. In addition to somatic point 
mutations, we evaluated the presence of somatic structural variants 
(Fig. 4b-e and Extended Data Table 2). We detected small deletions 
(91-443 kb) in 3 out of 14 small intestinal ASCs and a larger deletion 
(2Mb) in one ASC. Notably, colon ASCs showed complex and larger 
chromosomal instability in 4 out of 15 colon ASCs, including a com- 
plex translocation (Fig, 4d) and a trisomy (Fig. 4e). These events are 
characteristic of segregation errors that can occur during cell division, 
and are a hallmark of many colorectal cancers”*. In addition, other 
factors, such as tissue clonality or external agents may also contrib- 
ute to the difference in cancer incidence between colon and small 
intestine. 

Here we have shown that ASCs of organs with different cancer inci- 
dences gradually accumulate mutations at similar rates, but that the 


¢, Two large (>1 Mb) tandem duplications identified in liver ASC 14-a 
with microhomology at the breakpoints; duplications are indicated in the 
schematic representation of the identified structural variants below the 
graph. d, A complex structural variation (an unbalanced translocation 
involving 3 chromosomes) identified in colon ASCs 4-b and 4-e. Coloured 
lines in the schematic below show the predicted derivative chromosomes. 
¢, Read-depth analysis indicating a trisomy of chromosome 13 in colon 
ASC 3-c. Each data point represents the median chromosome copy 
number per 500-kb bin plotted over the genome, with alternating colours 
for each successive chromosome. 


mutation profiles are tissue-specific. In the ASCs of the tissues assessed 
here, mutation accumulation is primarily driven by a combination of 
proliferation-dependent mutation incorporation following spontane- 
ous deamination of methylated cytosine residues and another process 
with a currently unknown underlying molecular mechanism. Notably, 
the former intrinsic, unavoidable mutational process can cause the 
same types of mutation as those observed in cancer driver genes. We 
have shown that, at least in colon ASCs, this class of mutations could 
have a role in driving tumorigenesis. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper, references unique to 
these sections appear only in the online paper. 
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METHODS 

No sample-size estimate was calculated before the study was executed. The 
experiments were not randomized. The investigators were not blinded to allocation, 
during experiments and outcome assessment. 

Human tissue material. Endoscopic, colorectal and duodenal biopsy samples 
‘were obtained from individuals of different ages that had been admitted for sus- 
pected inflammation. One individual (donor 1) showed no inflammation during 
colonoscopy, but was later diagnosed with microscopic colitis, The other indi- 
viduals were found to be healthy based on standard histological examination. 
Endoscopic biopsies were performed at the University Medical Center Utrecht and 
the Wilhelmina Children's Hospital. ‘The patients’ informed consent was obtained 
and this study was approved by the ethical committee of University Medical Center 
Utrecht. Additionally, normal tissue was isolated from resected colon segments, 
at >Sem distance from a tumour in three colorectal cancer patients (donors 3, 4 
and 19). The colonic tissues were obtained at The Diakonessen Hospital Utrecht 
‘with informed consent and the study was approved by the ethical committee, Liver 
biopsies (0,51 cm') were obtained from donor livers during transplantations per- 
formed at the Erasmus Medical Center, Rotterdam. Both liver and colon biopsies 
‘were obtained from donor 18, The Medical Ethical Council of the Erasmus MC 
approved the use of this material for research purposes, and informed consent was 
provided by all donors and/or relatives. 

Establishment of clonal ASC cultures. Dissociated colon and small intestinal 
crypts were isolated from the biopsies and cultured for 1 - 2 weeks under conditions 
that are optimal for stem-cell proliferation, as previously described*. Liver cells 
‘were isolated from human liver biopsies and cultured as previously described® 
From these cultures, single cells were sorted by flow cytometry and clonally 
expanded (Extended Data Fig. 1a). Clonal ASC cultures were subsequently estab- 
lished by manual picking of individual organoids derived from single cells and in 
vitro expansion fora period of ~6 weeks. 

‘Whole-genome sequencing and read alignment. DNA libraries for Illumina 
sequencing were generated using standard protocols (Illumina) from 200ng - 11g 
‘of genomic DNA isolated from the clonally expanded ASC cultures with genomic 
tips (Qiagen). The libraries were sequenced with paired-end (2 x 100bp) runs 
using Illumina HiSeq 2500 sequencers to a minimal depth of 30x base coverage. 
Samples of donors 1, 2,3, 4, 10, 12, 13, 15, 16, 18 and 19 were sequenced using 
lumina HiSeq X Ten sequencers to equal depth. The reference samples, blood 
or biopsy, were sequenced similarly. Sequence reads were mapped against human 
reference genome GRCh37 using Burrows-Wheeler Aligner v0.5.9 mapping too?” 
with settings ‘bwa mem -c 100 -M! Sequence reads were marked for duplicates 
using Sambamba v0.4.7 (ref. 28) and realigned per donor using Genome Analysis 
‘Toolkit (GATK) IndelRealigner v2.7.2 and sequence read-quality scores were 
recalibrated with GATK BaseRecalibrator v2.7.2. Alignments from different 
libraries of the same ASC culture were combined into a single BAM file. 

Point mutation calling, Raw variants were multi-sample (per donor) called using 
the GATK UnifiedGenotyper v2.7.2 (ref. 29) and GATK-Queue v2.7.2 with default, 
settings and additional option ‘EMIT_ALL_CONFIDENT_SITES: The quality 
of variant and reference positions was evaluated using GATK VariantFiltration 
v2.7.2 with options ‘-filterExpression “MQO >4 && ((MQO / (1.0 * DP)) 
> 0.1)"-filterName “HARD_TO_VALIDATE"-filterExpression “QUAL < 30.0 
*-filterName “VeryLowQual”-filterExpression "QUAL > 30.0 && QUAL < 50.0 
“filterName “LowQual”-filterExpression “QD < 1.5 “-filterName “LowQD”. 
Point mutation filtering. To obtain high-quality catalogues of somatic point 
mutations, we applied a comprehensive filtering procedure (Extended Data 
Fig. 1b). We considered variants that were passed by VariantFiltration and had 
a GATK phred-scaled quality score >100, Subsequently, for each ASC culture, 
‘we considered the positions with a base coverage of at least 20x in both the 
culture and the reference sample (blood or biopsy). Furthermore, we only regarded 
variants at autosomal chromosomes. We excluded variant positions that over- 
lapped with single-nucleotide polymorphisms (SNPs) in the SNP database 
(abSNP) v137.b37 (ref. 30). Furthermore, we excluded all positions that were 
found to be variable in at least two of three unrelated individuals (that is, donor 5, 6 
and X (not in study)) to exclude recurrent sequencing artefacts. To obtain somatic 
point mutations, we filtered out all variants with any evidence of the alternative 
allele in the reference sample. We validated the clonal origin of the sequenced ASC 
cultures by analysing the variant allele frequencies (VAFs) of the somatic muta- 
tions. Two cultures (donor 14, cell band donor 17, cll c) showed a shift in the peak 
of the somatic heterozygous mutations to the left, indicating that they did not arise 
from a single stem cell, and were therefore excluded from the analysis (Extended 
Data Fig. 2). Finally forall cultures we excluded point mutations with a VAF<0.3 
to exclude mutations that were potentially induced in vitro after the (first) clonal 
step (Extended Data Fig. 1b-d). The number of mutations that passed each 
filtering step for the samples of donor 5 and 6 is depicted in Extended Data Fig, le. 
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‘The overlap of the point mutations between ASCs of the same donor is depicted 
in Extended Data Fig. 4d. 

‘Validations of point mutations. We evaluated our mutation filtering procedure 
by independent validations of 374 pre-selected positions that were either discarded 
or passed during filtering using amplicon-based next-generation sequencing 

‘To thisend, primers were designed ~250 nucleotides 5! and 3! from the candidate 
‘point mutations to obtain amplicons of ~500bp (primer sequences available upon 
request). ‘These regions were PCR-amplified for both the organoid cultures and 
reference samples of donor 5 and 6, using 5ing genomic DNA, 1x PCR Gold 
Buffer (Life Technologies), 1.5mM MgCl, 0.2mM of each dNTP and 1 unit of 
Ampli'faq Gold (Life Technologies) in a final volume of 10 11. This which was 
held at 94°C for 60 s followed by 15 cycles at 92°C for 30s, 65°C for 305 (with a 
decrement of 0.2°C per cycle) and 72°C for 603; followed by 30 cycles of 92°C. 
for 305s, 58°C for 30s and 72°C for 60s; with a final extension at 72°C for 180s. 

‘The PCR products were pooled and barcoded per culture. Illumina sequence 
libraries were generated according to the manufacturer's protocol. Subsequently, 
the libraries were pooled and sequenced using the MiSeq platform (2 x 250 bp) 
to an average depth of ~100x. Alignment and variant-calling was performed as 
described above. For each ASC we evaluated those positions with at least 20% 
coverage for both culture and reference sample, and defined positive positions as 
those with a call in culture, with a VAF > 0.3 and no callin the reference sample. 

Subsequently, we determined the number of confirmed negatives of the positions 
that were filtered out for each filter step (Extended Data Fig. 1d). Moreover, we 
determined the number of confirmed positive of the positions that passed all 
filters (Extended Data Fig. le, f), 

Assessment of effects of in vitro culturing on ASC mutation load. We expanded 
10 initial clonal organoid cultures from small intestine and liver for a further 3-5 
months (equivalent to ~20 weekly passages), upon which we isolated single 
cells and subjected them to clonal expansion to obtain sufficient DNA for WGS 
(Extended Data Fig. 6a). This approach allowed usto catalogue the mutations that 
accumulated in single ASCs during the culturing period between the two clonal 
steps. To this end, we selected the somatic point mutations that were unique to the 
ssub-clonal cultures and not present in the corresponding original clonal cultures 
and therefore acquired during the in vitro expansion. We evaluated the specific- 
ity of our mutation-discovery procedure by determining the confirmation rate 
of the mutations identified in the original clone in the corresponding subclone, 
Only positions that had a coverage of >20x in both the original clonal and corre- 
sponding subclonal culture as well asin the reference sample were evaluated. On 
average, 91,19 ++ 4.87 (mean +s.d). of these point mutations were confirmed in 
the subclonal cultures (Extended Data Fig, 3). 

Correlation between ASC somatic point mutation accumulation and age. The 
surveyed area per ASC was calculated as the number of positions coverage >20 in 
both culture and the reference sample. The percentage of the whole non-N autosomal 
genome (GCRh37: 2,682,655,440 bp) that is surveyed in each ACS is depicted 
in Extended Data Table 1. For each ASC the total number of identified somatic 
point mutations was extrapolated to the whole non-N autosomal genome using its 
surveyed area. Subsequently, a linear mixed-effects regression model was fitted to 
estimate the effect of age on the number of somatic point mutations for each tissue 
using the nlme R package", in which ‘donor’ is modelled asa random effect to 
resolve the non-independence that results from having multiple measurements per 
donor. A two-tailed t-test was performed to test whether the slope is significantly 
different from zero (that isto say, whether the fixed age effect in the linear mixed 
model is statistically significant). The intercept of the regression lines with the 
yyaxis represents the somatic mutations present at birth (that have accumulated in 
the tissue lineage during prenatal development) plus the noise levels in the data 
and the mutations that have accumulated during the first week(s) of culturing 
proceeding the clonal step (see above). Since all cells were assessed in a similar 
‘manner, noise levels will be comparable and therefore will not bias the mutation 
rate (slope) estimates. The slope of the regression line was used to estimate the 
fixed age effect on somatic point mutation rate per tissue. 

‘To exclude the possibility that differences in surveyed areas between ASCs bias 
‘our results, we performed the age correlation and spectrum analyses on a subset 
‘of mutations that are located in genomic regions that are surveyed (>20%) in all 
samples in this study. This consensus surveyed area comprises 38.2% of the auto- 
somal non-N genome and both the mutation rate and spectra were highly similar 
to those in Fig. 1c (Extended Data Fig, 4a-c), indicating that the differences in 
surveyed areas between the clones do not bias our conclusions. 

Definition of genomic regions. To generate a conserved DNA replication timing. 
profile for the human genome, we downloaded 16 Repli-seq data sets from the 
ENCODE project™ at the University of California, SantaCruz (UCSC) genome 
browser™ (GRCh37/hg19). The data consisted of Wavelet-smoothed values per 
1-kb bin throughout the genome for 15 different cell lines (BJ, BGO2ES, GMO06990, 
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GM12801, GM12812, GM12813, GM12878, HeLa-S3, HepG2, HUVEC, IMR90, 
K562, MCF-7, NHEK and SK-N-SH). We considered the median values of all 
cell lines per bin, thereby excluding cell-specific values. We arbitrarily divided 
the genome into early- (260), intermediate- (>33 & <60) and late- (<3) rep- 
licating bins (Fig. 3b). To generate a conserved chromatin-association profile 
for the human genome, we downloaded data containing the H3K9me3 signal 
per 25-nucleotide bin throughout the genome for 22 different cell lines (A549, 
‘AG04450, DND41, GM12878, H1-hESC, Hel.a-S3, HepG2, HMEC, HSMM, 
HSMMt, HUVEC, K562, monocytes-CD14+_RO1746, NH-A, NHDE-Ad, 
NHEK, NHLE, osteoblasts, MCF-7, NT2-D1, PBMC and U208) and the H3K27ac 
signal for 9 different cell lines (CD20-+_RO01794, DNDAI, HI-hESC, Hela-S3, 
HSMM, monocytes-CD14+_RO1746, NH-A, NHDF and osteoblasts). Data were 
downloaded from the ENCODE project™ at the UCSC browser" (GRCh37/hg19) 
and the median values of all cell lines per bin were calculated, Next, we deter- 
mined the distribution of the fractions of all bins (genome-wide). According to 
the shape of the resulting graph, we considered bins with an H3K9me3 value >4, 
or an H3K27ac value >2, as associated with that chromatin mark. Finally, 
exonic sequences were defined as all exonic regions reported in Ensembl v75, 
(GCRB37)*: 

Enrichment or depletion of point mutations in genomic regions. We determined 
‘whether somatic point mutations were enriched or depleted in the genomic regions 
described above. To this end, we determined how many point mutations were 
‘observed in each genomic region for each donor. Next, we calculated the number 
‘of bases that were surveyed in each genomic region and calculated the expected 
number of point mutations by multiplying this surveyed length with the genome- 
wide point-mutation frequency. The log,(observed/expected) of the mutations in 
the genomic regions was used asa measure of the effect size of the depletion or 
enrichment, One-tailed binomial tests were performed to calculate the statistical 
significance of deviations from the expected number of mutations in the genomic 
regions using pbinom"; P < 0.05 was considered significant. 

‘Mutational signatures. The occurrences of all 96-trinucleotide changes were 
counted for each ASC and averaged per donor. Three mutational signatures were 
extracted using NMF*, To determine the replication bias of signatures, we deter- 
mined whether the point mutations were located in an intermediate, early or late 
replicating region (as defined above) using GenomicRanges” and repeated the 
NMF on a 288 count matrix (96 trinucleotides » 3 replication timing regions). 
Similarly, we looked at transcriptional strand bias by performing NME on a 192 
count matrix (96 trinucleotides * 2 strands). To this end, we selected all point 
mutations that fall within gene bodies and checked whether the mutated C or T'was 
located on the transcribed or non-transcribed strand, We defined the transcribed 
units of all protein coding genes based on Ensembl v75 (GCRh37) and included 
introns and untranslated regions. 

Selection analysis (dN/dS). The dN/dS ratio was determined as described pre- 
viously"®. In brief, we used 192 rates, one for each of the possible trinucleotide 
changes in both strands. For each substitution type, we counted the number of 
potential synonymous and non-synonymous mutations in the protein-coding 
sequences of the human genome, using the longest DNA coding sequence as the 
reference sequence for each gene, Poisson regression was used to obtain maxi- 
mum-likelihood estimates and confidence intervals of the normalized ratio of 
non-synonymoaus Versus synonymous mutations (AN/dS ratio). The dN/dS ratio 
‘was tested against neutrality (AN/dS = 1) using a likelihood-ratio test, 
Comparison of mouse and human intestinal ASCs mutation loads. Intestinal 
ASCs were isolated from the proximal part of the small intestine of randomly 
chosen ~2-year-old mice (one male and one female) carrying the Lgr5-EGFP- 
Ires-CreERT?2 allele (mice were C57BL/6 background) by sorting for GEP™#" 
cells. Subsequently, three Lgr5-positive cells per animal were clonally expanded 
as described'. All experiments were approved by the Animal Care Committee of 
the Royal Dutch Academy of Sciences according to the Dutch legal ethical guide- 
lines. DNA isolated from the intestinal ASC cultures isolated from mouse 1 were 
sequenced with paired-end (75 and 35 bp) runs using SOLiD 5500 sequencers 
(Life Technologies) to an average depth of ~18x base coverage. Intestinal ASC 
cultures of mouse 2 were sequenced using Illumina HiSeq 2500 sequencers as 
described above. Sequence reads were aligned using Burraws-Wheeler Aligner to 
the mouse reference genome (NCBIM37) and point mutations were called using. 
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the GATK UnifiedGenotyper v2.7.2 as described above. Post-processing filters for 
the intestinal ASCs of mouse 1 (analysed by SOLiD sequencing) were as follows: a 
minimum depth of 10x, variant uniquely called in one intestinal stem cell without, 
more than one alternative allele found at the same position in the other ASCs of the 
same mouse, a GATK a phred-scaled quality score >100, variant absent in mouse 2, 
variant position absent in the dbSNP (build 128) and a VAF > 0.25. Post-processing 
filters forthe intestinal ASCs of mouse 2 (analysed by lumina sequencing) were 
as described above for the human mutation data 

Cancer-associated mutation spectra analysis in driver genes, Mutations iden- 
tified in the indicated genes in colorectal or liver cancers were downloaded from 
BioPortal (http://www.cbioportal.org/). Only point mutations that resulted in a 
missense, nonsense or splice-site mutation were considered. 

CNV detection. To detect copy-number variations (CNVs), BAM files were ana- 
lysed for read-depth variations by CNVnator v0.2.7 (ref, 39) with a bin size of 1 kb 
and Control-FREEC v6.7" with a bin size of Skb. Highly variable regions, defined 
as harbouring germline CNVsin at least three control samples, were excluded from 
the analysis. To obtain somatic CNVs, we excluded CNVs for which there was 
evidence in the reference sample (blood/biopsy) of the same individual. Resulting 
candidate CNV regions were assessed for additional structural variants on the 
paired-end and split-read level through DELLY v0.3.3 (ref. 41). Based on these 
results, we excluded five candidate CNV regions as mapping artefacts on the read- 
depth level and acquired base-pair accuracy of the involved breakpoints for the 
‘other events. This also revealed the tandem orientation of the duplication events 
and the complex structural variation in the colon sample, 

Reported gene definitions (Extended Data ‘Table 2) are based on Ensembl v75 
(GCRh37)”, Common fragile sites overlapping the events were detected using 
existing definitions”. LINE/SINE elements within 100 bp of the breakpoints 
were determined with the repeat element annotation’ from the UCSC genome 
browser! GCRh37 (retrieved 26 October 2015). 

Code availability. All code and filtered ver files are freely available under a MIT 
License at https://wgsl L.op.umcutrecht.nl/mutational_patterns_ASCs/ and https:// 
sgithub.com/CuppenResearch/MutationalPatterns/ 


27. Li, H. & Durbin, R, Fast and accurate short read alignment with Burrows- 
Wheeler transform, Bioinformatics 25, 1754-1760 (2009), 

28, Tarasov, A, Vilella, A.J, Cuppen, E, Nijman, LJ. & Prins, P. Sambamba: 
fast processing of NGS alignment formats. Bioinformatics 31, 2032-2034 
2015). 

29, DePristo, M.A. et al. A framework for variation discovery and genotyping using 
next-generation DNA sequencing data. Nat. Genet. 43, 491-498 (2011). 

30. Sherry, S. Tet al. dbSNP: the NCBI database of genetic variation. Nucleic Acids 
Res. 29, 308-311 (2001). 

1. R Core Teatn. R: A language and environment for statistical computing: 
http://wwwur-projectore/ (2015). 

32, Pinheiro J etal nlme: Linear and Nonlinear Mixed Etfects Models, 
httas://cran.-project.org/web/ packages/nlme/nime.paf (2016). 

33, ENCODE Consortium. An integrated encyclopedia of DNA elements in the 
human genome, Nature 489, 57-74 (2013). 

34, Rosenbloom, KR. et al The UCSC Genome Browser database: 2015 update, 
‘Nucleic Acids Res. 43, D670-D681 (2015). 

35, Cunningham, F et al. Ensembl 2015, Nucleic Acids Res. 43, 0662-D669 
(2015). 

36. Gaujoux, R. & Seoighe, C.A flexible R package for nonnegative matrix 
factorization. BMC Bioinformatics 11, 367 (2010). 

37. Lawrence, M. etal. Software for computing and annotating genomic ranges. 
LOS Comput. Biol. 9, 1003118 (2013). 

38, Abyzov, A, Urban, A. E, Snyder, M. & Gerstein, M. CNVnator: an approach to 
discover, genotype, and characterize typical and atypical CNVs from family and 
population genome sequencing. Genome Res. 21, 974-984 (2011), 

39, Boeva, V, et al. Control-FREEC: a tool for assessing copy number and allelic 
content using next-generation sequencing data. Bioinformatics 28, 423-425 
2012), 

40, Rausch, Tet al DELLY: structural variant discovery by integrated paired-end 
and split-read analysis. Bioinformatics 28, 333-1339 (2012) 

41. Le Tallec, B. et al. Common fragile site profiling in epithelial and erythroid cells 
reveals that most recurrent cancer deletions lie in fragile sites hosting large 
genes, Cell Reports 4, 420-428 (2013), 

42, Jurka, J. Repbase update: a database and an electronic journal of repetitive 
elements. Trends Genet. 16, 418-420 (2000). 


LETTER 


‘ 
§| 
ta < si i 
: —— 
: = ame 


Organeid culture 


sf — 


& 
(wes) 


Liver 


i 


Lseos 


Clonal organoid culture 


& 


Single ASCs ‘Alternative reference sample: 


6 & (wes) 


Blood 


FILTER STEPS MUTATIONS PASSING FILTERS 
‘ASCs s VALIDATIONS OF MUTATIONS 
Sa Sb Se &a 6b Gc FILTERED OUT 
‘ASCs Unique 
p> $404.78 4.494378 4,494,378 4,563,975 4.565.975 4.563,975 S.a Sb $0 6a Gb 6 Total positions 
peiconectyfiteed ot 9 0 0 0 att 
Correct fitered 17 20 18 22 24 22 123° 41 
Seine p> 8282027 3,101,003 3,211,703 3,646,100 3.465.866 3.504.650 
peincorectyfiteed o OO 4 1 4 3 
a Nason Correct filered 14 16 14 14 19 14 85 25 
ae p> 75899 74560 76398 92212 85.258 89.451 
peicorsaiyfied 0 1 0 9 9 9 4 4 
@. No recurring Correctly filtered 14 13 16 15 16 17 91 22 
p> 28515 31976 33.186 40.94 37272 39.620 
peitcorectyfiteed 0 0 9 0 0 9 OO 
Correct filered 0 00 000 0 0 
p 7% 5 97 201.868 1972 
peircorecty fered 0 0 1 0 9 0 4 4 
Correctly fitered 30 65 58 35 62 56 308 156 
245 30451 508 


6. VAF 20.3 
& > 2 


————————— 


e VALIDATIONS OF MUTATIONS 
PASSING ALL FILTERS 
‘ASCs 
5a Bb eo 6a Bb 6c Total 
Total positions tested 43 15 19 38 14 22 154 
Confirmed mutations 37 11 18 34 10 21 134 
Not confirmed 6 4 1 4 4 1 20 
f MANUAL INSPECTION OF 
NOT CONFIRMED POSITIONS 
‘ASCE 
5a Bb Be 6a 6b 6c Total 
True positive 4 4 4 1 2 o 6 
False positive 3 3 0 2 1 1 10 
Missed in germline 2 0 o 4 1 0 4 
Extended Data Figure 1 | Cataloguing somatic mutation loads in a position that is not located in the surveyed areas of the assessed 


human ASCs. a, Schematic overview of the experimental setup to 
determine somatic mutations in individual human ASCs. Colon, small 
intestine and liver biopsies were cultured in bulk for 1-2 week(s) before 
single cells were sorted and clonally expanded until enough DNA could be 
isolated for WGS analysis. WGS of the clonal organoid culture allows 

for cataloguing of somatic variants in the original ASCs that gave rise 

to the clonal cultures that were acquired during life and the first 714 days 
of culturing. Biopsy or blood was sequenced as a reference sample. 

b, Filter steps to obtain somatic mutations in ASCs. ¢, Number of point 
mutations that pass each corresponding filter step in a for each ASC 
culture of donors 5 and 6. d, Independent validations of mutations that 
were filtered out by amplicon-based resequencing. The asterisk indicates 
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ASCs in the original experiment, which is corrected for in all analyses. 

¢, Independent validations of mutations that passed all filters by amplicon- 
based re-sequencing. Confirmed positions are defined as those with 

a call in the indicated ASC with a VAF > 0,3 and without a call in the 
corresponding reference sample. f, Qualification of unconfirmed positions 
based on manual inspection. True-positive positions are positions that 
were correctly called, but for which the VAF threshold was not met in 

the validation experiment. False-positive positions are positions without 
evidence in the validation experiment or are noisy. ‘Missed in germline’ 

are positions that were called in the reference sample in the validation 
experiment. 
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Extended Data Figure 2 | Variant allele frequency distribution plot 
for each assessed ASC. A distribution plot of the VAFs of all somatic 
mutations that remain before filtering for the VAF in filter step 6 
(Extended Data Fig. 1b). Clonal heterozygous somatic mutations form 

a peak around VAF=0.5. A threshold of VAF > 0.3 was used to obtain 
somatic mutations that were clonal in the organoid cultures and therefore 


present in the original cloned ASCs (see Methods). Mutations acquired 
after the single ASC expansion step are subclonal (that is, not present in all 
cells of the clonal culture) and have lower VAFs. Two samples (donor 14, 
ASC 14-b and donor 17, ASC 17-c) showed a shift in the main VAF peak to 
the left, indicating that these cultures did not arise from a single ASC and 
were therefore excluded from the study. 
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observed in the original cloned culture, in the corresponding subcloned 
mutations. a, Overlap of somatic point mutations between the clonal culture, Data are represented as the mean percentage of confirmed point 
organoid cultures and corresponding subcloned cultures depicted in mutations over all clone-subclone pairs indicated in a (n= 10) and error 
Extended Data Fig, 6. b, Confirmation rate of point mutations, which were bars represent s.d. 
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Extended Data Figure 4 | Somatic mutation loads in consensus- 
surveyed area and overlap of point mutations between ASCs from the 
same donor. a, Correlation of the number of somatic point mutations per 
ASC, which were observed in the genomic regions that were surveyed 
(for example, a base coverage of at least 20% in both the clonal culture 
and the reference sample; Methods) in all the ASCs, with the age of the 
donors per tissue indicated. This consensus-surveyed area comprises 
38.2% of the non-N autosomal genome. Each data point represents a 
jingle ASC. Indicated are the P values of the age effects in the linear mixed 
model (two-tailed t-test) for each tissue. The sample sizes for colon, 
small intestine and liver are 6, 9 and 5 donors and 21, 14 and 10 ASCs, 
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intervals of the slope estimates, ¢, Relative contribution of the indicated 
mutation types to the point mutation spectra in the consensus-surveyed 
area per tissue type. Data are represented as the mean relative contribution 
of each mutation type over all ASCs per tissue type (n= 21, 14 and 10 for 
colon, small intestine and liver, respectively); error bars represent s.d. The 
total number of identified somatic point mutations per tissue is shown. 

4, Overlap of the somatic point mutations between ASCs of the same 
donor. The number of point mutations, observed in the total surveyed area 
per ASC, that are shared between the assessed ASCs of the same donor is 
indicated. 


respectively. b, Somatic mutation accumulation rate per tissue as estimated 
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Extended Data Figure 5 | Point-mutation spectrum per donor. Relative 
contribution of the different types of point mutation to the spectrum of 

each donor. Data are represented as the mean relative contribution of each 
mutation type when multiple ASCs were measured per donor (the number 


2.768 mutations 926 mutalions 


of ASC per donor is depicted for each donor) and error bars represent 


standard deviation, Indicated are the age of the donors, the total number of 


point mutations used to determine each spectrum and the tissue type. 
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Extended Data Figure 6 | Mutation patterns associated with long-term 
in vitro expansion of ASCs. a, Schematic overview of the experimental 
setup to catalogue mutations associated with the organoid culture system, 
Clonal small intestinal and liver organoid cultures (Extended Data Fig. 1a) 
were cultured for 3-5 months. A second clonal expansion step was 
subsequently performed, followed by WGS analysis, to catalogue all the 
mutations that were present in the subcloned ASCs, To obtain mutations 
that were specifically acquired during culturing, mutations in the original 
clonal cultures were subtracted from those observed in the corresponding 
second subcloned cultures. b, Relative contribution of the indicated 
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Point mutation type 
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‘mutation types to the point mutation spectra specifically observed in vitro 
per tissue type. Data are represented as the mean relative contribution of 
each mutation type over all subcloned ASCs per tissue type (n =6 and 

4 for small intestine and liver, respectively) and error bars represent s.d. 
Indicated are the total number of identified somatic point mutations, 
which were specifically acquired between the two clonal expansion 

steps indicated in a, per tissue. c, Relative contribution of the mutational 
signatures depicted in Fig. 2a, which explain the mutation spectra 
depicted in b. 
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Extended Data Figure 7 | Non-random distribution of mutational 
signatures throughout the genome. a, Context- and replication-timing- 
dependent mutation spectrum of the three mutational signatures depicted 
in Fig, 2a. Indicated is the contribution of each trinucleotide to the 
signatures (order is similar as in ref, 11), subdivided into the fraction of 
the trinucleotide-change present in early, intermediate or late replicating 
genomic regions. b, log: ratio of the observed and expected number of 
mutations per indicated base substitution (summed over all trinucleotides) 
in early-, intermediate- and late-replicating genomic regions for each of 
the signatures depicted in a. log, ratio indicates the effect size of the bias 
and asterisks indicate significant DNA-replication-timing bias (P< 0.05, 
binomial test). , log, ratio of the total number of observed and expected 
mutations in early-, intermediate- and late-replicating genomic regions for 
each signature depicted in a, log: ratio indicates the effect-size of the bias 


and asterisks indicate significant DNA replication timing bias (P< 0.05, 
binomial test). d, Context- and transcriptional-strand-dependent 
mutation spectrum of the three mutational signatures depicted in Fig. 2. 
Indicated is the contribution of each trinucleotide to the signatures 
(order is similar to that in ref. 11), subdivided into the fraction of the 
trinucleotide-change present on the transcribed and untranscribed 
strand. e, log. ratio of the number of mutations on the transcribed and 
untranscribed strand per indicated base substitution for each signature 
depicted in d. log: ratio indicates the effect size of the bias and asterisks 
indicate significant transcriptional strand bias (P < 0.05, binomial test). 

£, The dN/dS ratio for all protein-coding somatic point mutations observed 
in all ASCs per tissue type. Error bars indicate 95% confidence intervals 
(likelihood ratio test) 
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Extended Data Figure 8 | Comparison of mutation loads between human linear fit. b, Relative contribution of the indicated mutation types 
intestinal ASCs derived from human and mouse. a, Mutation frequency __to the point mutation spectra for all assessed human intestinal ASCs and 
in mouse intestinal ASCs is compared to the linear fit, describing the for each mouse. Data are represented as the mean relative contribution of 
relationship between the mutation frequency in human intestinal ASCs each mutation type over all the ASCs per indicated category (n= 14, 3 and 
and age of the donor, Indicated by the dotted lines are the mean mutation _3 for human, mouse 1 and mouse 2, respectively), error bars indicate s.d. 


frequencies over all ASCs per mouse (= 3) and the corresponding age of 
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Extended Data Table 1 | Overview of somatic point mutations detected in ASCs 


‘ASC Donor Age (years) Gender ‘Tissue Surveyed genome (%)" _No. point mutations? 
ta 8 Female Colon 03.8 458 
tb 4 8 Female Colon 12 400 
te 4 9 Female Colon 978 867 
2a 2 15 Male Colon 96.8 923 
2b 2 18 Male Colon 96.8 997 
ea 18 | 53 Male Colon 98.5 2,468 
eb 18 53 Male Colon 98.1 2,201 
a 19 | 56 Male Colon 977 2,655 
1b 19 56 Male Colon 974 2,384 
1c 19 | 56 Male Colon 98.2 3,383 
19d 19 | 56 Male Colon 978 2811 
4a 4 66 Female Colon 907 2,140 
4b 4 66 Female Colon 953 2,234 
4c 4 66 Female Colon 957 2,392 
4d 4 66 Female Colon 93.9 2,386 
ae 4 66 Female Colon 96.1 2,289 
ae 3 67 Female Colon 918 2,409 
ab 3 er Female Colon 18 2,882 
Be 3 67 Female Colon 919 2,561 
ad 3 67 Female Colon 920 2,667 
ae 3 er Female Colon 920 2,987 
ba 5 3 Female ‘Smallintestine 89.0 246 
bb 5 3 Female ‘Smallintestine 85.8 245 
Be 5 3 Female Smallintestine 88.5 304 
6a 6 8 Female ‘Smallintestine 97.1 591 
eb 6 8 Female Small intestine 93.5 506 
eco 6 8 Female Smallintestine 96.2 614 
Ta 7 44 Male ‘Small intestine 91.4 1,461 
Ba 8 45 Male Small intestine 95.5 1,838 
oa 9 45 Male Small intestine 93.9 1571 
1a 10 70 Female ‘Small intestine 948 2,497 
ita 1174 Male Small intestine 87.3 2613 
2a 12 78 Female --‘Smallintestine 95.6 3.516 
13a 13 87 Male Small intestine 97.7 3512 
13b 13 (BT Male Small intestine 97.0 1,987 
4a 14 30 Male Liver 813 ™ 
4b 14 30 Male Liver 85.2 age 
a 15 41 Female Liver 935 4,292 
1b 15 41 Female Liver 95.1 1,351 
a 17 48 Female Liver 79.4 1,495 
ib 17 46 Female Liver 737 4,273 
Bo 18 53 Male Liver 979 1,845 
18d 18 53 Male Liver 98.5 1,504 
ee 18 53 Male Liver 98.2 4577 
16a 16 55 Male Liver 975 1919 


‘Percentage ofthe non-N autosomal genome with >20x coverage in both ASC culture and reference sample 
{Number af somatic paint mutations detactad within survayed genome 
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Extended Data Table 2 | Identified somatic structural variations in ASCs 


Copy Number Variants 


Sample Tissue Chr Start Stop Size Type 'No. genes Fragile sile Microhomology Genes at breakpoint LINE/SINE 
WSC ha Liver 3 S4401,729 95 651,811 1,160,082 gain 5 5 Bop F TINCT 

ASC 4a Liver 3 111,726,406 113,471,637 1,745,231 gain 46 : 2bp TAGLNSATPVIA LIMCA|LIMS 
ASC 16-a Liver 9 80,763,759 141,213,431 90,449,672 gain a2 NA NA NA 

ASC 18-2 Liver 7 432,754,706 133,009,202 257,496 gain 2 : Obp CHCHD3IEXOC4 —MIRILIPAS 

ASC 18-4 Liver 5 9,125,105 59,718,364 593,259 loss 1 - ob PDEADIPDEAD —-LIPAS 

ASC 8a Smallintestine 5 «3.815.936 3,908,819 92,883 loss ° : 2bp + I 

ASC 11-2 Smallintestine 2 205,420,067 205,511,877 91,810 loss. 1 FRAZ 1 bp PARDSB)PARD3B  AluSx/LIME3B 
ASC 13-4 Smallintestine 11 63,974,352 66,222,668 2,248,316 loss 3 3bp FERMT3, “Lima 
ASC13+b Smallintestine 1 5,878,566 6,321,750 443,184 loss 13 FRAIA—1bp + THE1B) 

ASC 1-¢ Colon 3 60,700,662 61,199,328 498,666 oss 4 FRASB 1 bp FHITFHIT LAPASILIPAS 
ASC 3:2 Colon 130 115,169,878 115,169,878 gain 427 NA NA NA 

ASC 4-b&e Colon 14 102,805,595 104,172,376 1,966,781 loss 87 : NA NA NA 

ASC 4-b&e Colon 17 2429,169 2,672,747 143,578 —_ loss 5 : CTTGins —JPAFAHI81 ‘AlulolAUuSq 

ASC 4-b8e Colon 172,634,433 2,997,007 292,574 _loss 4 : NA NA NA 

Unbalanced Translocations 

Sample Tissue Chr (1) Position (1) Chr(2) Position (2) Type No. genes Fragile site Microhomalogy Genes at breakpoint LINE/SINE 

FSC BEE Colon Ta 102505 585 17 7,634,145 translocation NA 5 Top ZNFESOF F 

ASC 4-b&e Colon 14 104,172,376 18 18,518,987 translocation NA : Obp XRCC3}- ALR Alpha 

ASC 4-b&e Colon 172,927,007 18 18,518,987 transiocation NA : Obp RAFIGAP2/__LIPABIALR Alpha 


"io. genes, numberof genes overlapping the event; fragile ste, common fragile sites averlapping the event; mierohamology, number af bases af microhomalogy observed at breakpoints; genes at 
breakpoint, gene bach affected by the breakpoint: LINE/SINE elements, observed elements within 100 bp o the breakpoint. 
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Formation of new chromatin domains determines 
pathogenicity of genomic duplications 


Martin Franke!*, Daniel M. Ibrahim!***, Guillaume Andrey', 
Katerina Kraft"2, Rieke Kempfer', Ivana Jerkovic! 


Martin Vingron®, Francois Spitz? & Stefan Mundlos!*3-!7 


Chromosome conformation capture methods have identified 
subchromosomal structures of higher-order chromatin interactions 
called topologically associated domains (TADs) that are separated 
from each other by boundary regions. By subdividing the 
genome into discrete regulatory units, TADs restrict the contacts 
that enhancers establish with their target genes**. However, 
the mechanisms that underlie partitioning of the genome into 
TADs remain poorly understood. Here we show by chromosome 
conformation capture (capture Hi-C and 4C-seq methods) that 
genomic duplications in patient cells and genetically modified 
mice can result in the formation of new chromatin domains 
(neo-TADs) and that this process determines their molecular 
pathology. Duplications of non-coding DNA within the mouse 
Sox9 TAD (intra-TAD) that cause female to male sex reversal in 
humans®, showed increased contact of the duplicated regions 
within the TAD, but no change in the overall TAD structure. In 
contrast, overlapping duplications that extended over the next 
boundary into the neighbouring TAD (inter-TAD), resulted in the 
formation of a new chromatin domain (neo-TAD) that was isolated 
from the rest of the genome. As a consequence of this insulation, 
inter-TAD duplications had no phenotypic effect. However, 
incorporation of the next flanking gene, Kenj2, in the neo-TAD 
resulted in ectopic contacts of Kenj2 with the duplicated part of the 
Sox9 regulatory region, consecutive misexpression of Kenj2, anda 
limb malformation phenotype. Our findings provide evidence that 
‘TADs are genomic regulatory units with a high degree of internal 
stability that can be sculptured by structural genomic variations. 
This process is important for the interpretation of copy number 
variations, as these variations are routinely detected in diagnostic 
tests for genetic disease and cancer. This finding also has relevance 
in an evolutionary setting because copy-number differences are 
thought to havea crucial role in the evolution of genome complexity. 

SOX9 is a developmental transcription factor with important 
functions in chondrocyte differentiation and male sex determination’, 
‘The SOX9 locus is a genomic region that has been linked to various 
human diseases with a broad range of phenotypes®*”, We investi- 
gated how genomic duplications in this region affect higher-order 
chromatin organization and, in particular, the formation of TADs. 
Hi-C, a technology to quantify chromatin contacts genome-wide, 
shows a compartmentalization of the human locus in two major TADs, 
one containing SOX9 (referred to hereafter as SOX9 TAD), the other 


ibke Schwarzer‘, Verena Heinric! 
, Wing-Lee Chan?, Malte Spielmann”, 
Ingo Kurth®”, | Paola Cambiaso", Orsetta Zuffardi!”, Gunnar Houge"’, Lindsay Lambie", Francesco Brancati!, Ana Pombo 
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Bernd Timmermann’, Lars Wittler’, 
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containing the two potassium channels KCN/2 and KCNJI6 (referred to 
hereafter as KCN] TAD) (Fig. 1a)”. The large gene desert corresponding 
to the SOX9 TAD has been shown to contain multiple regulatory 
elements and human-disease-related sites (Fig. 1a)°, Duplications 
including a region 0.5 megabases (Mb) upstream of SOX9 (referred 
to as the RevSex region) lead to female-to-male sex reversal (Fig. 1a), 
indicating that the RevSex region contains a critical regulator of SOX9 
expression in the developing gonads”. Surprisingly, duplications 
that include the RevSex region but extend further upstream towards 
the neighbouring genes KCNJ2 and KCNJI6 have no effect on sexual 
development. Instead, they result in Cooks syndrome, a congenital 
limb malformation characterized by aplasia of nails and short digi 
We identified a family (mother and daughter) with a third type of 
duplication that includes the RevSex region and the entire gene desert 
upstream of SOX9, but not the KCNJ2 and KCN/16 genes (Fig. 1a). In 
spite of complete overlap with the reported sex reversal duplications 
and large parts of the Cooks syndrome duplication, carriers of this 
variant are phenotypically normal. According to the Hi-C profile, 
all sex reversal associated duplications are located within the SOX9 
TAD, whereas the duplication with no phenotype and also the Cooks 
syndrome duplications extend into the neighbouring KCNJ TAD 
thereby spanning two TADs and their boundary. We refer two these 
two types of duplications as intra-TAD and inter-TAD, respectively. 
To investigate the difference between the intra-TAD (sex reversal) 
and the inter-TAD (no phenotype) duplications, we performed 4C-seq 
of patient fibroblasts and compared them to controls. In control cells, 
the SOX9 and the KCN/2 viewpoint showed an interaction profile 
restricted to their Hi-C predicted TADs (Fig. 1b). 4C-seq from a patient 
with sex reversal and a 150 kilobase (kb) intra-TAD duplication showed 
a slight increase with the duplicated part from SOX9, but an overall 
unchanged profile from both viewpoints (Fig. 1c). Similarly, no change 
in the configuration of the interaction profiles was observed with the 
larger inter-TAD (no phenotype) duplication (Fig. 1d). To identify 
the region specifically contacted by the duplicated part, we used the 
unique sequence created by the duplication breakpoints to perform 
allele-specific 4C-seq, In the intra-TAD duplication, we observed high 
interaction with the entire SOX9 TAD (purple track, Fig. 1c). Similar 
results were obtained with fibroblasts from a second individual with 
a 470kb intra-TAD duplication (Extended Data Fig. 1). By contrast, 
the interaction profile generated from the inter-TAD duplication 
breakpoint was restricted to the duplicated region, without contacting 


'Max Planck institute for Molecular Geneties, RG Development & Disease, 14195 Berlin, Germany, “Institute for Medical and Human Genetics, Charité Universit 
Germany. *Berlin Institute of Health, 10117 Berlin, Germany. “Developmental Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany. °Max Planck Insti 
Department of Computational Molecular Biology, 14195 Berlin, Germany. "Max Planck Institute far Molecular Genetics, Sequencing Core Facility, 14195 Berlin, Germany, 
tment Developmental Genetics, 14195 Berlin, Germany. “institute of Human Genetics, Jena University Hospital, 07743 Jena, Germany, 
52074 Aachen, Germary. °Bambino Gesh Children’s Hospital-RCCS, 00165 Rome, taly. "Department of Molecular Medicine, University 


Molecular Genatis, 
Max Planck Institute for Molecular Genetics, Depa 
institute of Human Genetics, Uniklnik RWTH Aa 


tsmedizin Berlin, 18353 Berlin, 


‘of Pavia, 27100 Pavia aly. Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, 5021 Bergen, Norway. “Division af Human Genetics, National Health Laboratory 
‘Service, University ofthe Witwatersrand, 2000 Jehannesburg, South Africa, “Department a Life, Health and Erwiranmental Sciences, Unversity of LAquila, 67100 Aqui aly Istituto 
Dermopatico del'Immacelata (1D) IRCCS, 00167 Rome, Italy. Berlin Institute for Medical Systems Biology, Max Delbrlck Center far Molecular Medicine, 13125 Berfin-Buch, Germany. 
 Berin-Brandenburg Center for Regenerative Therapies (BCRT), Charité Universtatsmedizin Bern, 12353 Berlin, Germany. 


“These authors contributed equally to this work 


13 OCTOBER 2016 | VOL 538 | NATURE | 265 


16 Macmillan Publishers Limited, part af Springer Nature. All rights reserved 


LETTER 


a@cwi7 — §7t00.000 8.100000 28,100000, yaamqg00 7110000 
rai0) 
ec =. 
Haran aes 
Scale : ‘j % 


Cooks syndrome inter-TAD) 
‘No phenotype (inter-TAD) 
‘Sex reversal (intra-TAD) 


Control oF 


Figure 1 | 4C-seq from human fibroblasts with intra-TAD and inter- 
‘TAD duplications. a, SOX9 locus with Hi-C interaction profile (H1 
embryonic stem cells’). Schematic shows KCNJ TAD (blue), SOX9 TAD 
(brown) and TAD boundaries (red hexagons). Ovals indicate regions 
associated with sex reversal (RevSex) ot Pierre Robin syndrome (PRS), 
human duplications with corresponding phenotypes are indicated above 
(line thickness indicates maximum/minimum size). b, 4C-seq from 
control skin fibroblasts using SOX9 and KCN/2 as viewpoints (triangles). 
¢, 4C-seq from fibroblasts of an individual with sex reversal and intra-TAD 
duplication shows no change in overall interaction profile (below, log, 
ratio to control), 4C-seq from duplication breakpoint (below, in purple) 
shows increased interaction with the duplicated part and the rest of the 
SOX9 TAD. d, 4C-seq from fibroblasts of individual with no phenotype 
and inter-TAD duplication, Note unchanged interaction profiles from 
SOX9 and KCNJ2 viewpoints. 4C-seq from duplication breakpoint 
shows interactions restricted to the duplicated region. All reads mapped 
to awild-type genome (resulting in split viewpoint for duplication 
breakpoints). See Methods for sample collection. 


the SOX9, KCN]2 or KCNJ16 genes (purple track, Fig. 1d). To further 
investigate allele-specific interactions with the different viewpoints, 
we performed whole-genome sequencing of the patient sample with 
the inter-TAD duplication and determined if certain single nucleotide 
polymorphisms (SNPs) were contacted by one viewpoint and not 
the other, We identified several SNPs that were only contacted by 
the duplication specific viewpoint but not by the SOX9 or KCNJ2 
viewpoints, suggesting that the duplicated region contacted primarily 
itself, thereby forming a separate domain (Extended Data Fig. 2 and 
Methods). 

To investigate the effect of these duplications on TAD structure in 
more detail, we re-engineered the human duplications in mice and 
performed capture Hi-C (cHi-C) and 4C-seq of the extended Sox9 
locus from wild-type and mutant limb buds at embryonic day 12.5 
(E12.5) (Fig. 2). We used CRISPR/Cas9 genome editing to create 
mice with a 400kb intra~TAD duplication (Dup-S), equivalent to a 
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published patient with sex reversal®, Heterozygous Dup-S mice were 
phenotypically normal. This is probably explained by the finding that 
a known mouse gonadal enhancer, TESCO, resides outside of the 
duplication", and the human sex-reversal-associated regions could 
so far not be linked to Sox9 regulation in mouse", The larger 1.6Mb 
inter-TAD duplication (Dup-L), equivalent to the individuals with no 
phenotype, was generated using the Cre/loxP system and trans-allelic 
recombination. 

In agreement with the human data, our data show that the mouse 
locus is subdivided into a Sox9 TAD, containing Sox9 as the only 
gene, and a Kenj TAD with Kenj2 and Kenj16, separated by a bound- 
ary located in the gene desert (Fig. 2a). cHi-C from mice with the 
intra-TAD duplication (Dup-S) showed no change in the overall TAD 
structure (Fig. 2b). Subtraction of the wild-type profile from the Dup-S 
profile demonstrated increased interaction of the duplicated part with 
itself, but also with the rest of the Sox9 TAD including Sox9. By con- 
trast, CHi-C from the inter-TAD duplication (Dup-L) showed a new 
interaction domain covering the duplicated parts of the Kenj and Sox9 
TADs (Fig. 2c). The subtraction profile showed a strong increase in 
interaction in comparison to wild type, but only within the duplicated 
region (thus excluding Sox9, Kenj2 and Kenj16) and no changes in the 
adjacent TADs. This suggested that the new interaction domain was 
isolated from the neighbouring Sox9 and Kenj TADs. 4C-seq from 
outside of the duplication (Sox9 and Kenj2) confirmed the isolation of 
this domain by showing wild-type interaction profiles and no ectopic 
contacts (Extended Data Fig, 3). 4C-seq from the lacZ reporter located 
at the duplication breakpoint corroborated our other findings, showing 
interaction only with the duplicated region (Extended Data Fig. 3b). 
In agreement with the observed isolation of the duplication domain, 
Dup-L mice were phenotypically normal and fertile. 

Thus, the two types of duplications differ fundamentally in their 
effect on chromatin organization in spite of the fact that they overlap 
toa large degree. The intra-TAD duplication (resulting in sex reversal) 
has no effect on the overall TAD structure, whereas the larger inter- 
TAD duplication (resulting in no phenotype in humans and mice) 
resulted in the generation of a new chromatin domain that we refer 
to as a ‘neo-TAD’ The insulation of the neo-TAD prevents interac- 
tion with the neighbouring Sox9, Kenj2 or Kenj16 genes and explains 
the lack of pathology in humans and in mice that carry this type of 
duplication. 

‘The presence of the lacZ reporter in the middle of the Dup-L dupli- 
cation also allowed us to investigate the regulatory potential of the 
neo-TAD by analysing the lacZ expression pattern in Dup-L mutants 
and compare it to lacZ reporters inserted in the Kenj and Sox9 TADs 
(Sleeping Beauty (SB) transposons carrying lacZ; SB-Kenj and SB-Sox9 
alleles; Fig, 2d and Extended Data Fig. 4). SB-Sox9 gave a strong signal 
essentially recapitulating the endogenous Sox9 expression, whereas 
SB-Kenj showed a restricted signal in the developing jaw and nose. The 
LacZ staining obtained in Dup-L was very similar to that of SB-Sox9, 
suggesting that the regulatory sequence contained in the neo-TAD 
was functional and sufficient to recapitulate most of the Sox9 expres- 
sion domains, but not that of Kcnj2. This was further supported by the 
analysis of a deletion (Del-L allele, Extended Data Fig. 4) in which the 
corresponding region had been removed by trans-allelic recombination. 
LacZ staining in Del-L mice showed a complete loss of the Sox9, but 
a preservation of the Kenj2 expression domains (Fig. 2d). Thus, the 
regulatory region in the neo-TAD was able to drive expression in a 
tissue-specific Sox9 pattern, As predicted by reporter assays and histone 
modification data, the region contains a large number of regulatory 
sites (Extended Data Fig. 5a) that, as shown by the lacZ reporter in 
Del-L mice, control the major part of Sox9 expression. Interestingly, 
comparison of the interaction profiles of the Kenj2 and Sox9 viewpoints 
in wild type with the profile from the lacZ viewpoint in Dup-L revealed 
a similar pattern and peak distribution of 4C-seq profiles, suggesting 
that the newly formed domain folded in a structured manner retaining 
its wild-type interaction characteristics (Extended Data Fig. 5). 


part of Springer Nature, All rights reserved. 


LETTER 


‘a 110,000,000 114,000,000 142,000,000 —118,000.000 g SB-Soxd | SB-DupL SB-Kenj | SB-DelL 
3 
Kenj16 Keni2_@ Soxd 
pe ec . 
b 110,000,000 111,000,000 112,000,000 113,000,000 © 110,000,000 111,000,000 _112,000,000__113,000,000 
a, -_ 
cHI-C cHI-G 
lntra-TAD Inter-TAD 
Dup-S Dup-L 
ame QO Ro gt emma RIB, RN? @ SB burl 
cHi-c cHi-c 
Subtraction Subtraction 
to wild type to wild type 
ses re nar 
: S8-Dup: 
@ Kenji6 Ken? @ | @ KenjI6 \Kenj2 @ SE 
a oe 
—— 


Figure 2 | Capture Hi-C from intra-TAD and inter-TAD duplications 

in mice. a, Capture Hi-C (cHi-C) of the Sox9 locus from wild-type E12.5 
limb buds showing separation in two TADs, Schematic on the top right 
depicts chromatin folding in Kenj (blue) and Sox9 (brown) TAD. b, cHi-C 
of intra-TAD duplication in Dup-S mice and subtraction map relative 

to wild type (below) shows an increase in interaction frequency of the 
duplicated region with itself (arrow) and the Sox9 TAD (asterisks) c, cHi-C 
of inter-TAD duplication in Dup-L mice and subtraction map (below) 


We hypothesized that an extension of the duplication towards the 
next flanking genes Kenj2 and Kenj16, as reported in human Cooks 
syndrome®, would result in incorporation of these genes in the neo- 
‘TAD. We generated this inter-TAD duplication (Dup-C) by trans-allelic 
recombination (Fig. 3). cHi-C of E12.5 Dup-C limb buds showed, like 
in the Dup-L mutant, a new chromatin domain corresponding to the 
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shows formation of a neo-TAD (brackets), Kenj and Sox9 TAD indicated 
by dashed lines. All coordinates chr11 (mm9). See Methods for sample 
collection. d, Optical projection tomography (OPT) of LacZ staining 

with insertion of lacZ reporter in Sox9 TAD (SB-Sox9), at the duplication 
breakpoint (SB-Dup-L), in Kenj TAD (SB-Kenj), and in a deletion 
(SB-Del-L) corresponding to Dup-L (see Extended Data Fig. 4). Note 
recapitulation of Sox9 pattern in Dup-L and loss in Del-L. Scale bar, 1 mm. 


duplicated region (Fig. 3a). The subtraction showed a strong increase 
in interaction between the duplicated parts of the Sox9 and Kenj TADs 
including the Kenj2 and Kenj16 genes, but not with Sox9. Ectopic 
interaction of Kcnj2 with parts of the Sox9 TAD was confirmed by 
4C-seq from mouse limb buds and by analysing fibroblasts from a Cooks 
syndrome patient (Extended Data Fig. 6). To investigate the effects 


Figure 3 | Capture Hi-C, gene expression and 
phenotype in inter-TAD Cooks syndrome 
duplication. a, cHi-C and subtraction map 
relative to wild type (below) of inter-TAD 
duplication in Dup-C E12.5 limb buds sho 
formation of new interaction domain (ne 
TAD, bracket) including Kenj2 and Kenj16 

Kenj and Sox9 TAD indicated by dashed lines. 
Coordinates chr11 (mm9). b, Whole mount 

in situ hybridization for Sox9 and Kenj2 at E125. 
Note Sox9-like expression of Kenj2 in Dup- 
digits, 1 =5. ¢, RNA-seq from E12.5 limb buds 
shows upregulation of Kenj2 in Dup-C mice 
only (Benjamini-Hochberg adjusted P value, 
n=2, see Methods). d, Phenotypic comparison 
of wild type and the Dup-C mutant. Note (top 

to bottom) absence of claw, abnormal shape and 
positioning of phalanges (micro-CT), hypoplasia 
of nail bed and abnormal distal phalanx 
(Masson-Goldner). Enlargement (dashed 
boxes) showing joint fusion in Dup-C mutants, 
Phenotype is fully penetrant (r= 30), 
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of these ectopic contacts on gene expression, we performed whole- 
mount in situ hybridization and found Kenj2 to be expressed in the 
digit anlagen, with a pattern similar to Sox9 (Fig. 3b). RNA sequencing 
(RNA-seq) expression analysis of Dup-C limb buds at E12.5 and E17.5 
confirmed the upregulation of Kcnj2, whereas other genes around the 
locus stayed unchanged, in particular Sox9, but also Kenj16 (Fig. 3c and 
Extended Data Fig. 7). Thus, the inclusion of Kenj2 in the neo-TAD 
resulted in its activation by regulatory elements that originally belonged 
to the Sox9 TAD. In contrast to Kenj2, Kenj16 was not responsive to 
ectopic activation. 4C-seq profiles in human fibroblasts with KCN]2 
and KCNJI6 as viewpoints showed slight differences in local intensity 
but generally the same size of ectopic interaction (Extended Data 
Fig. 6), indicating that a certain permissiveness or specificity must be 
present for promoter activation in this setting. 

‘These new contacts and the associated misexpression of Kenj2 were 
accompanied by major phenotypic changes. Heterozygous Dup-C mice 
showed a limb malformation phenotype at birth closely resembling 
Cooks syndrome’, highlighted by the absence or severe hypoplasia 
of all claws or nails (Fig. 3d). Micro-computed tomography (CT) 
and histology demonstrated an abnormal shape and size of the distal 
phalanges, which were fixed in a straight position due to a malformed 
terminal phalanx and a partly fused interphalangeal joint. To rule out 
that the Cooks phenotype was merely produced by increased gene 
dosage of Kenj2, we created intra-TAD duplications that included Kenj2 
and Kenj16 (Dup-K; and Dup-K,) (Extended Data Fig. 4). These mice 
were normal and had no digit phenotype. In addition, we created a 
second Cooks allele (Dup-Cz) that included Kenj2 in the duplication, but 
not Kcnj16, These mice showed the typical Cooks syndrome phenotype. 
Taken together, our data suggest misexpression of Kenj2 as the cause for 
Cooks syndrome. The Drosophila melanogaster KCNJ2 homologue Irk2, 
an inwardly rectifying K* channel, has been shown to have additional 
functions in development via the dpp (bone morphogenic protein, 
BMP) pathway”, Mutations in components of the BMP pathway are a 
major cause of abnormalities in digits and joints", providing a possible 
connection to the observed pathology in Cooks syndrome. 

‘The Dup-C duplication resulted in the inclusion of Kenj2 in the 
neo-TAD and its positioning next to Sox9 regulatory domain without 
an intervening boundary. We hypothesized that a similar effect might 
be achieved by removing the boundary between the Sox and Kenj 
TADs. To test this hypothesis, we deleted using CRISPR/Cas9 the 
predicted boundary region (Bor), a small (18 kb) region containing 
conserved CTCF binding sites. Mice with this deletion (Bor) had no 
apparent phenotype. cHi-C analyses of homozygous ABor limb buds 
showed an increase of interaction but no fusion of the Kenj and Sox9 
‘TADs (Extended Data Fig. 8a). This ectopic interaction resulted in the 
upregulation, but no site-specific misexpression of Kenj2 (Extended 
Data Fig. 8b). Thus, deletion of the boundary resulted in ectopic 
contacts, as previously reported by others in vitro", but the overall 
TAD structure remained unchanged. Similar results were obtained 
when deleting the boundaries in the Dup-L duplication (Extended 
Data Fig. 8). We observed increased interaction between the TADs, 
but overall the neo-TAD remained stable, indicating that the neo- 
TAD behaved like a ‘regular’ TAD. The importance of boundaries 
in restricting chromatin interactions was highlighted previously by 
deletions at the Ephad locus’, However, in these experiments large por- 
tions of the adjacent TADs were deleted together with the boundaries, 
thereby disrupting the overall TAD structure. Our present data indicate 
that other factors, such as additional CTCF sites and loops within 
TADs" contribute to TAD stability. The deletion of a boundary alone 
has therefore no major consequences, whereas larger deletions result in 
a re-organization of the locus enabling new contacts. 

Duplications are generally thought to confer their phenotypic effect 
through an increase in gene dosage, but often the observed pheno- 
type cannot be explained by alterations in gene dosage. Our data show 
how duplications can have different effects on higher-order chromatin 
structure, depending on their size and position. Duplications that are 
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Figure 4 | Duplication-induced effects on chromatin organization and 
phenotype. Intra-TAD duplications (blue) do not change the overall TAD 
conformation but can result in abnormal gene regulation (sex reversal). 
‘The centromeric and telomeric parts of the tandem duplications are 
indicated. Inter-TAD duplications crossing TAD-boundaries (green and 
yellow) result in the formation of new chromatin domains (neo-TADs). 
Insulation from their neighbours results in neutralization of regulatory 
effects and normal phenotype. Incorporation of genes in the neo-TAD 
provides the duplicated gene with a novel regulatory landscape and can 
result in gene misexpression (Cooks syndrome). 


confined toa TAD (intra-TAD) have no major effect on TAD structure 
but can result in increased interaction of duplicated regulatory elements 
with their target gene. In contrast, duplications that cover parts of two 
TADs and their boundary (inter-TAD) result in the formation of a 
new TAD that is insulated from its neighbours. We propose that these 
newly created domains should be called ‘neo-TADs. Genes that become 
incorporated in a neo-TAD can be activated by its regulatory elements, 
thereby eliciting pathogenic effects, Our findings also demonstrate that 
the genomic effects of structural variations cannot solely be explained 
by the rewiring of enhancer-promoter contacts". Rather, our data show 
that TADs re robust and stable genomic units that can be rearranged and 
recombined to create new regulatory regions of the genome. The integrity 
of these units in relation to neighbouring TADs and genes determines 
their gene regulatory and thus pathogenic effect. Figure 4 shows a sche- 
matic of the proposed disease mechanism associated with TAD changes. 

‘The concept presented here provides a framework to predict the phe- 
notypic outcome of genomic variations that can be directly applied for 
the interpretation of copy number variations (CNVs) detected in diag- 
nostic screens, routinely performed in patients with congenital mal- 
formations and/or intellectual disability'®, or for structural variations 
found in cancer'”, Furthermore, the effects of genomic rearrangements 
described here are probably important for evolutionary mechanisms, 
as duplications are thought to be a major driver in evolution'®"”, The 
process of gene neofunctionalization is thought to work through gene 
duplications and subsequent adaptation of one of the gene copies. Our 
data suggest a further mechanism in which the isolation of a newly 
formed TAD can result in a phenotypic change in the organism that is 
then directly subject to selective pressure, without affecting the parent 
copy of the gene. With variable shifting of TADs and recombination of 
regulatory activity with new target genes, an entire toolbox of possibil- 
ities for new gene functions can be acquired. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are avaiable in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

No statistical methods were used to predetermine sample size. There was no 
randomization of experiments, and investigators were not blinded during 
experiments and outcome assessment. 

ES cell targeting and transgenic mouse strains. Embryonic stem (ES) cell 
culture was performed as described previously”. ES and feeder cells were tested 
for mycoplasma contamination using Mycoalert detection kit (Lonza, catalogue 
number LT07-118) and Mycoalert assay control set (Lonza, catalogue number 
L.T07-518). A single LoxP site upstream of Kenj16 (mm9 chr11:110,772,110) was 
targeted in G4-ES cells (129/Sv x C57BL/6 FI hybrid) using CRISPR/Cas9 and a 
single-stranded donor oligonucleotide (ssODN). ES cells were cultured on MEF 
feeder cells under standard conditions and then cotransfected (FuGene HD, 
Promega) with 81g of pX459 vector (Addgene) carrying the single guide RNA. 
(cenKenj) and 500 pM of a ssODN (40bp homologous sequence flanking a LoxP 
site), sequences given in Supplementary Table 1. To delete the 18kb TAD boundary 
region (Bor) in G4 ES cells and ES cells from Dup-L homozygous mice, two 
sgRNAs (cenA\Bor, telA\Bor) were cloned in pX459 vectors and cotransfected. The 
intra-TAD duplication was generated in Ga ES cells using two sgRNAs (cenDupS, 
telDupS), Embryos and live animals from ES cells were generated by diploid or 
tetraploid complementation’. Genotyping was performed by PCR analysis. 

‘The SB-Ken) and SB-Sox9 alleles contain an insertion of Sleeping Beauty (SB) 
transgene”. This transgene harbours a single Lox? site and a lacZ reporter gene, 
flanked by transposons. Both alleles were targeted using standard protocols for 
homologous recombination in E14 ES cells, Primer sequences for amplifying 
homology sequences are provided in Supplementary Table I. Positive ES cell clones 
‘were injected into donor blastocysts to generate chimaeras, The SB-Kenjallele was 
further used for remobilization of the SB transgene, following the protocol in ref. 22, 
to generate new SB insertion sites at the locus. Duplications and corresponding 
deletions were generated as described previously”. All generated duplications! 
deletions and donor mouse strains are listed in Extended Data Fig. 4 

‘Mouse strains were maintained by crossing them with C57BL6/J mice. All 
animal procedures were conducted as approved by the local authorities (LAGeSo 
Berlin) under the license numbers G0368/08 and GO247/13. 

ES cell generation of Dup-L mice. ES cells from Dup-L homozygous blastocysts 
were established using N2B27 medium supplemented with FGF/Erk and Gsk3 
pathway inhibitors (2/) and LIF according to ref, 24. An established ES cell line was 
used for subsequent boundary deletion experiments using CRISPR/Cas9 
4C-seq. 4C-seq libraries were generated from microdissected mouse tissues or 
suman fibroblasts as described previously”. The starting material for all 4C-seq 
libraries was 5 x 10° to 1 x 10” cells, which corresponds to limb buds from a pool 
of eight 12.5 embryos. All 4C-seq experiments were carried out in heterozygous 
animals, 4-bp cutters were used as primary and secondary restriction enzymes 
(Supplementary Table 2). For each viewpoint, a total ofl to 1.6yig DNA was ampli- 
fied by PCR (Supplementary Table 2). All samples were sequenced with Illumina 
Hi-Seq technology according to standard protocols. 4C-seq experiments from all 
viewpoints were carried out in biological replicates in wild type and Dup-L mutants 
and tested for reproducibility (Pearson R for WT:Sox9 = 0.87, Kenj2=0.90; 
Dup-L:Sox9=0.94, Kenj?=0.94, lacZ = 0.89). A representative result is shown in 
the figures. Experiments from patient samples and Dup-C mutants were performed 
as singletons 

Bioinformatics. For 4C-seq data analysis, reads were pre-processed, mapped 
toa corresponding reference (GRCh37/hg19 or NCBI37/mm)) using BWA” 
and coverage normalized as reported previously’, The viewpoint and adjacent 
fragments 1.5kb upstream and downstream were removed and a window of 10 
fragments was chosen to normalize the data per million mapped reads (RPM). 
‘To compare interaction profiles of different samples, we obtained the logs-fold 
change for each window of normalized reads. To obtain ratios duplicated regions 
‘were excluded for calculation of the scaling parameter used in RPM normalization 
Codeis available upon request. 

SNP analysis of 4C-seq data. To indicate selective interactions in a patient with 
an inter-TAD duplication (no phenotype), raw reads of 4C-seq experiments with 
‘viewpoints in SOX9, KCNJ2 and in the duplication breakpoint were mapped using 
BWA* without pre-processing and variants were called for all samples together 
using GATK (v3.4-46)” for the duplicated region (chr17:68195430-69981335, 
hg19). Bi-allelic SNP positions with a minimum coverage of 10 reads were 
selected. These were called homozygous for one allele contacted by the breakpoint 
viewpoint and homozygous for the other allele contacted by either the SOX9 or 
KCNJ2 viewpoint, Additionally, whole-genome sequencing was performed and 
alignment and variant calling was done in the same way. Allele frequencies of 
the WGS experiment were then compared to the allele frequencies of the 4C-seq 
experiments atthe selected SNP positions. 

SureSelect design. The library of SureSelect enrichment probes were designed over 
the genomic interval (mm9, chr11:109010000-114878000) using the SureDesign 
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online tool of Agilent, Probes are covering the entire genomic region and were 
not designed specifically in proximity of Dpnl sites. The probes covered 88% of 
the interval, 

Capture Hi-C. cHi-C libraries were prepared from homozygous E. 
(except Dup-S) as described previously. In summary, 5 x 10°to 1 x 1 
used for crosslinking, cel lysis, Dpnll digestion, ligation and de-crosslinking. 

Re-ligated products were then sheared using a Covaris sonicator (duty cycle: 10%, 
intensity:5, cycles per burst: 200, time: 6 cycles of 60 s each, set mode: frequency 
sweeping, temperature: 4 to 7°C), Adaptors were added to the sheared DNA 
and amplified according to Agilent instructions for Illumina sequencing. The 
library was hybridized to the custom-designed sure-select beads and indexed 
for sequencing (50 to 100 bp paired-end) following Agilent instructions, Capture 
Hi-C experiments were performed as singletons. As an internal control, we 
compared the results from six experiments for regions outside of the region of 
Interest (chr! 1:109,010,001~110,250,000 and che 1:113,100,001-1 14,870,000). 

‘The cHi-C maps of the internal control were highly correlated between the six 
samples (Spearman R: WT/Dup-L= 0.96; WT/Dup-C= 0.94; WT/Dup-S = 0.96 


limb buds 
cells were 


WT/A Bor = 0.96; WT/Dup-LA Bor = 0.96; Dup-L/Dup-C 
Dup-L/Dup-S = 0.96; Dup-L/A Bor = 0.96; Dup-L/Dup-LA Bo 
Dup-C/Dup-S = 0.94; Dup-C/A Bor = 0.94; Dup-C/Dup-LA Boi 


Dup-S/ABor = 0.96; Dup-S/Dup-LABor= 0.96; ABor /Dup-LABoi 
confirming the high reproducibility of the methodology. 
Bioinformatics. Preprocessing and mapping of paired-end sequencing data, as 
‘well as filtering of mapped di-tags was performed with the HiCUP pipeline v0.5.8 
(ref, 28). The pipeline used Bowtie2 v.2.2.6 (ref. 29) for mapping short reads to 
reference genome (NCBI37/mm9). Filtered di-tags were further processed with 
Juicebox' command line tools to bin di-tags (10kb bins) and to normalize the map 
by KR normalization. For this, only reads with a MAPQ > 30 were considered, The 
DNA-capturing step enriches genomic region chr11:109,010,001-114,878,000 on 
mm9 leading to three different regimes in the cHi-C map: (i) enriched versus 
enriched, (ii) enriched versus non-enriched, and (iii) non-enriched versus 
non-enriched. For binning and normalization only di-tags in regime (i) were 
considered, Therefore di-tags were filtered for the enriched region and mm9 coor 
dinates were shifted by —109,010,000 bp. For Juicebox a custom chromosome sizes 
file containing only the enriched region on chr! | (length 5,868,000 bp) was used. 
After binning and normalization, coordinates were shifted back to their original 
values, However, in general duplicated regions yield more signal compared to 
nnon-duplicated regions when mapped to the wildtype reference genome. The signal 
is attened disproportionately in duplicated regions by a normalization procedure 
that balances the whole interaction matrix, such as KR normalization, Therefore, 
we used only raw count maps to calculate the differences between samples. 
Difference maps were generated based on raw count maps, scaled individually by 
dividing each value of the matrix by a factor (sum ‘masked! triangle matrix/10*), 
‘To avoid copy number biases, the region spanning all tested duplications (chr: 
110,770,001~112,520,000) was not considered for the computation of the scaling 
factor, cHi-C maps of count values and difference maps were visualized with the 
WashU epigenome browser™, 

RNA-seq. £12.5 distal limbs were microdissected from wild-type and mutant 
embryos (1=2) and immediately frozen in liquid nitrogen. From these RNA was 
isolated using TRIzol extraction and the RNeasy Mini Kit (QIAGEN). Samples 
‘were poly-A enriched and sequenced (paired-end 50bp) using Illumina technology 
following standard protocols. 

Bioinformatics: 50bp paired-end reads were mapped to the mouse reference 
genome (mm9) using the STAR mapper” (splice junctions based on RefSeq; 
‘options: -alignintronMin 20-alignintronMax 500000-outFilterMultimapNmmax 
5-outFilterMismatchNmax 10-outFilterMismatchNoverLmax 0.1). Reads per 
gene were counted as described previously’, and used for differential expression 
analysis with the DEseq2 package™ 
Phenotypic analysis. Phenotypic analysis for mutant mouse lines was carried 
ut for at least three animals per analysis and developmental stage. Phenotypic 
analysis was carried out in heterozygous animals, for ABor in heterozygous and 
homozygous animals, and for Dup-LBor in homozygous animals. Penetrance of 
the phenotypes was investigated by analysing >30 offspring and considered fully 
penetrant if all animals were similarly affected. 

‘Micro-computer tomography. Autopods of seven-week-old control and mutant 
mice (n=3) were scanned using a Skyscan 1172 X-ray microtomography 
system (Brucker microCT, Belgium) at 5jim resolution, 3D model reconstruction 
‘was done with the Skyscan image analysis software CT-Analyser and CT-volume 
(Brucker microCT, Belgium). 

Whole-mount in situ hybridization and histology. Wild-type and mutant 
E12.5 embryos (n= 5) were subjected to whole mount in situ hybridization 
using standard procedures. Sox9 and Kenj2 probes were generated by PCR 
amplification using mouse limb bud cDNA (Supplementary Table 1). 
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For Masson-Goldner staining, limbs of newborns (PO, n =2) were first fixed in 
4% PEA in PBS containing 0.5 M EDTA and then embedded in paraffin, Paraffin 
sections of 6 iM were then stained with the Merck Masson-Goldner staining kit 
(catalogue number 100485) and Meyers haematoxylin (catalogue number MHS80) 
according to the manufacturer’ instructions, 

LacZ staining and optical projection tomography. £12.5 mouse embryos (n= 3) 
were dissected in cold PBS, fixed in 4% PEA in PBS on ice for 30min, washed twice 
with ice-cold PBS and once at room temperature (19-24°C), and then stained 
‘overnight for j-galactosidase activity in a humid chamber at 37°C as previously 
described”. After staining, embryos were washed in PBS and stored at 4°C in 4% 
PFA in PBS, 

For OPT scanning, stained embryos were embedded in 1% low-melt agarose 
and dehydrated over 1-2 days using methanol (2-3 methanol steps). Subsequently 
samples were cleared overnight in BABB (1 part benzyl alcohol, 2 parts benzyl 
benzoate). The samples were then scanned (802 frames) with a Bioptonics OPT 
3001 M scanner with white light (exposure = 10-100 ms) for LacZ staining 
using GFP filter combination (exciter 425 nm/40 nm - emitter LP47Snm ) and 
20-380ms exposure for autofluorescence. 3D reconstructions were generated using 
Skyscan software and further analysed with Amira and Imaris software, 

Human material. Skin biopsies from one sex reversal patient, the no phenotype 
individual, one Cooks syndrome patient and controls, as well as gonadal 


(testicular) fibroblasts from one sex reversal individual were obtained by 
standard procedures. Fibroblasts were cultured in DMEM (Lonza) supplemented 
‘with 10% fetal calf serum (Gibco), 19% t-glutamine (Lonza) and 1% penicillin/ 
streptomycin (Lonza), Written informed consent was obtained from all individuals 
studied that participated in this study. This study was approved by the Charité 
Universititsmedizin Berlin ethics committee. 
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Extended Data Figure 1 | 4C-seq from patient fibroblasts with a 470 kb 
intra-TAD duplication (sex reversal). Schematic representation depicts 
‘TAD structure with KCNJ TAD (blue), SOX9 TAD (brown) and TAD 
boundaries (red hexagons). Size and position of the 470kb intra-TAD 
duplication in a patient with female-to-male sex reversal is indicated by 
the overlap. Patient fibroblasts were derived from a gonadal biopsy of 
ovo-testes, 4C-seq tracks below show interaction profiles from SOX9 


(brown), KCNJ2 (blue) and breakpoint (purple) viewpoint, Note 
interaction profiles from SOX9 and KCNJ2 are restricted to their 
corresponding TADs. Unique viewpoint at the breakpoint shows that 
interactions are restricted to the SOX9 TAD. Note all reads mapped 
toa wild-type genome resulting in split viewpoint for the duplication 
breakpoint. 
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Extended Data Figure 2 | Allele frequencies determined in 4C-seq SOX9 (brown), KCN/2 (blue) and the duplication breakpoint (purple) 
data indicate selective interactions in a patient with an inter-TAD as viewpoints (triangles), SNPs contacted by individual viewpoints are 
duplication (no phenotype). SNP analysis related to Fig. 1d. indicated, Note that variants contacted by the breakpoint viewpoint are 
a, Schematic of SNP-analysis in a patient carrying an inter-TAD not contacted by SOX9 and KCNJ2 viewpoints, suggesting the formation 
duplication (heterozygous) with no phenotype. SNP positions and ofan insulated interaction domain (purple). b, Summary of observed SNP 
their allele frequency (AF) identified by whole-genome sequencing genotype and allele frequency from whole-genome sequencing data and 
from the patient are shown. Bottom, 4C-seq interaction profiles using 4C-seq viewpoints. 
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Extended Data Figure 4 | Overview of mouse alleles used in this study. The extent of each duplicated and deleted region in mice are given as mm9 
coordinates. Observed mouse phenotypes and human syndromes with the equivalent mutation are listed. For generation of alleles see Methods. 
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Extended Data Figure 5 | 4C-seq interaction profile of neo-TAD in 
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a, The position of published in vivo tested reporter constructs!” and 
active chromatin marks (H3K4mel/H3K27ac positive) in Sox9-expressing 
tissue (mouse ENCODE project) are indicated above the 4C tracks, 

The only known enhancer driving testis-specific expression, TESCO, is 
indicated. 4C-seq interaction profiles from Kenj2 (blue) and Sox9 (brown) 
in wild type and from the lacZ reporter gene (purple) in £12.5 Dup-L 
mutants are shown. Triangles indicate viewpoints. Note that 4C-seq reads 
are mapped to the wild-type genome resulting in split viewpoint from 


Telomere 

the lacZ viewpoint. The interaction profile from lacZ viewpoint located 
at the breakpoint of the Dup-L duplication shows a similar pattern and 
peak distribution (dashed boxes) as the Sox9 and Kenj2 viewpoints. Note 
similarities of peak profiles at the boundary between Kenj and Sox9 TADs 
(arrows in red dashed box). b, Schematic of locus with an artificially 
duplicated region and corresponding 4C-seq profile from the lacZ 
viewpoint inside the neo-TAD in Dup-L mutants. The centromeric and 
telomeric part of the tandem duplication is indicated. The purple 4C-seq 
profile corresponds to the neo-TAD delimited by the duplicated TAD 
boundary. 
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Extended Data Figure 7 | RNA-seq analysis of genes at the Sox9, Kenj2 
and Kenj16 locus from mouse mutants used in this study. a, b, RNA- 
seq from wild-type and mutant E125 limb buds and E17.5 fingertips 
(lower two tracks) shows read profiles for Kenj16, Kenj2 and Sox9. Note 
the absence of expression of Kcnj16 in the examined tissue. c, Summary 


of expression values of genes at the Sox9, Kenj2 and Kenj16 locus from 
E12.5 limb buds, Significant expression changes are in bold. FPKM, 
fragments per kilobase of exon per million fragments mapped, Benjamini- 
Hochberg-adjusted P value, n= 2, cut-off =0.001 
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Extended Data Figure 8 | Deletion of the TAD boundary region in wild- 
type and Dup-L mutant mice. a, Left, cHi-C from wild type (top) and 
ABor (middle) 12.5 limb buds and subtraction map of ABor relative to 
wild type (bottom). Deletion of the boundary between the Kenj and Sox9 
TADS leads to loss of insulation and ectopic interactions (*) between the 
‘TADs. Note overall TAD structure remains intact and ectopic interaction 
is restricted by remaining TAD boundaries. Right, cHi-C from Dup-L 
(top) and Dup-LABor (middle) E12,5 limb buds and subtraction map of 
Dup-LABor relative to Dup-L (bottom). Deletion of the two duplicated 
boundary regions flanking the neo-TAD, results in ectopic contacts of 


duplicated sequences with adjacent Kenj and Sox9 TADs (*) including 
Sox9 and Kenj2, as seen in ABor mutants. Loss of neo-TAD insulation 
results in increased interactions of the duplicated sequences (*"*). 

b, Expression analysis of Sox9 and Kenj2 in wild-type, ABor and Dup- 
LABor embryos. RNA-seq expression analysis of mutant versus wild- 
type E125 limb buds. Kenj2 is upregulated in ABor (heterozygous and 
homozygous) and Dup-LABor (homozygous) limb buds (Benjamini- 
Hochberg adjusted P values, n=2). Whole mount in situ hybridization 
shows no site-specific misexpression of Kenj2 in a Sox9-like pattern, 
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Two distinct RNase activities of CRISPR-C2c2 enable 
guide-RNA processing and RNA detection 


Alexandra East-Seletsky"*, Mitchell R. "Connell, Spencer C. 
Robert Tjian!” & Jennifer A. Doudna!?4.°8 


Bacterial adaptive immune systems use CRISPRs (clustered 
regularly interspaced short palindromic repeats) and CRISPR- 
associated (Cas) proteins for RNA-guided nucleic acid cleavage’. 
Although most prokaryotic adaptive immune systems generally 
target DNA substrates*, type III and VI CRISPR systems direct 
interference complexes against single-stranded RNA substrates™, 
In type VI systems, the single-subunit C2c2 protein functions as 
an RNA-guided RNA endonuclease (RNase)”"”. How this enzyme 
acquires mature CRISPR RNAs (crRNAs) that are essential for 
immune surveillance and how it carries out cfRNA-mediated RNA 
cleavage remain unclear. Here we show that bacterial C2c2 possesses 
a unique RNase activity responsible for CRISPR RNA maturation 
that is distinct from its RNA-activated single-stranded RNA. 
degradation activity. These dual RNase functions are chemically 
and mechanistically different from each other and from the crRNA- 
processing behaviour of the evolutionarily unrelated CRISPR 
enzyme Cpfl (ref, 11). The two RNase activities of C2c2 enable 
multiplexed processing and loading of guide RNAs that in turn allow 
sensitive detection of cellular transcripts. 

‘The first step of CRISPR immune surveillance requires the pro- 
cessing of precursor crRNA transcripts (pre-crRNAs), consisting 
of repeat sequences flanking viral spacer sequences, into individual 


(€202 homologue tree 


Leptotrichia buocalis(Lbu} 


zz) ___ 


Listeria seetiger (Lse) 


HHH =D 
Leptotrichia shahi (Lsh) 
Figure 1 | C2c2 proteins process precursor crRNA transcripts to 
generate mature crRNAS. a, Maximum-likelihood phylogenetic tree of 
C2c2 proteins. Homologues used in this study are highlighted in yellow. 
b, Diagram of the type VI CRISPR loci used in this study. Black rectangles 
denote repeat elements, yellow diamonds denote spacer sequences. Cas! 
and Cas? are only found in the genomic vicinity of LshC2c2. ¢, C2 


Knight”, David Burstein’, Jamie H. D. Cate!?-4, 


mature crRNAs that each contain a single spacer’? "*, CRISPR sys- 
tems use three known mechanisms to produce mature crRNAs: a 
dedicated endonuclease (for example, Cas6 or CasSd in type I and III 
systems)'*!, coupling of a host endonuclease (for example, RNase 
III with a trans-activating crRNA in type II systems)", or RNase 
activity intrinsic to the effector enzyme itself (for example, Cpfl in 
type V systems)", 

Because type VI CRISPR loci lack an obvious Cas6- or CasSd-like 
endonuclease or trans-activating crRNA", we wondered whether 
C2c2 itself might possess pre-crRNA processing activity, and if so, 
whether the mechanism would be distinct from Cpfl, an unrelated 
class 2 CRISPR effector that can process pre-crRNAs'', Using puri- 
fied recombinant C2c2 protein homologues from three distinct 
branches of the C2c2 protein family (Fig. 1a, b, Extended Data 
Figs 1, 2), we found that all three enzymes cleave 5/-end radiola- 
belled pre-crRNA substrates consisting of a full-length consensus 
repeat sequence and a 20-nucleotide spacer sequence (Fig. 1c). 
Mapping the cleavage site for each pre-crRNA-C2c2 homologue 
pair revealed that processing occurs either two or five nucleo- 
tides upstream of the predicted repeat-sequence hairpin structure, 
depending on the C2c2 homologue (Fig. 1c, Extended Data Fig. 3a). 
Notably, these mapped 5’-cleavage sites do not agree with the 
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mediated cleavage of pre-crRNA derived from the LbuC2c2, LseC2c2 and 
LshC2c2 CRISPR repeat loci. OH, alkaline hydrolysis ladder; T1, RNase 
T1 hydrolysis ladder. Processing cleavage reactions were performed with 
100 nM C2c2 and <1 nM pre-crRNA. Schematic of cleavage is depicted on 
the right and predicted pre-crRNA secondary structures are shown below, 
with arrows indicating the mapped C2c2 cleavage sites (nt, nucleotides). 
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Figure 2 | LbuC2c2-mediated crRNA biogenesis depends on both 
structure and sequence of CRISPR repeats. a, Representative cleavage 
assay by LbuC2c2 on pre-crRNAs containing structural mutations within 
the stem and loop regions of hairpin. Processed percentages listed below 
are quantified at 1h (mean +5.d,, n= 3). b, Bar graph showing the 
dependence of pre-crRNA processing on the CRISPR repeat sequence. 
‘The wild-type (WT) repeat sequence is shown below with individual bars 
representing tandem nucleotide mutations as noted in red. The cleavage 
site is indicated by cartoon scissors. Percentage processed was measured 
after 1h (mean + s.d., n=3). ¢, Divalent metal ion dependence of the 
crRNA processing reaction was tested by the addition of 10-50mM EDTA 
and EGTA to standard reaction conditions. 


previously reported sites for Leptotrichia shahii (LshC2c2) or Listeria 
seeligeri (LseC2c2) pre-crRNAs'”, Our analysis of the RNA sequenc- 
ing dataset in ref. 10 indicates agreement of the in vivo cleavage 
site with the in vitro site reported here (Extended Data Fig, 3b-i). 
Furthermore, cleavage assays using C2c2 from Leptotrichia buccalis 
(LbuC2c2) and a larger pre-crRNA comprising a tandem hairpin- 
repeat array yielded two products resulting from two separate cleav- 
age events (Extended Data Fig. 4a), consistent with a role for C2c2 
in processing precursor crRNA transcripts generated from type VI 
CRISPR loci. 

‘To understand the substrate requirements and mechanism of C2c2 
guide-RNA processing, we generated pre-crRNAs containing muta- 
tions in either the stem-loop or the single-stranded flanking regions of 
the consensus repeat sequence, and tested their ability to be processed 
by LbuC2c2 (Fig. 2). C2c2-catalysed cleavage was attenuated upon 
altering the length of the stem in the repeat region (Fig, 2a). Inversion 
of the stem-loop or reduction of the loop length also reduced the 
processing activity of C2c2, while contiguous 4-nucleotide mutations 
including or near the scissile bond completely abolished it (Extended 
Data Fig. 4b). More extensive mutational analysis of the full rRNA 
repeat sequence revealed two distinct regions on either side of the hair- 
pin with marked sensitivity to base changes (Fig. 2b). By contrast, there 
was no dependence on the spacer sequence for kinetics of processing 
(Extended Data Fig, 4c). This sensitivity to both flanking regions of the 
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hairpin is reminiscent of the sequence and structural motifs required 
by many Cas6 and Cas5d enzymes'*"*"", Cpfl, however, does not have 
any dependence on the 3 hairpin flanking region, as the variable spacer 
region abuts the hairpin stem"? 

‘The processing activity of LbuC2c2 was unaffected by the presence 
of divalent metal ion chelators (Fig. 2c), indicative of a metal-ion- 
independent RNA hydrolytic mechanism. Metal-ion-independent 
RNA hydrolysis is typified by the formation of a 2',3'-cyclic phosphate 
anda 5/-hydroxide on the 5’ and 3/ halves of the cRNA cleavage prod- 
ucts, respectively”. To determine the end-group chemical identity of 
C2c2-processed substrates, we incubated the 5’ flanking products with 
T4 polynucleotide kinase, which removes 2',3/-cyclic phosphates to 
leave a 3/-hydroxyl; this resulted in altered denaturing-gel migration of 
the 5! flanking product, consistent with the removal of a 3’ phosphate 
group (Extended Data Fig. 4d). The divalent metal ion independence 
of the C2c2 pre-crRNA processing activity is in stark contrast with the 
divalent metal ion dependency of Cpfl, the only other single-protein 
CRISPR effector shown to perform guide processing!". Collectively, 
these data indicate that C2c2-catalysed pre-crRNA cleavage is a divalent 
metal-ion-independent process that probably uses a general acid-base 
catalysis mechanism”", 

‘After maturation, crRNAs typically bind with high affinity to Cas 
effector protein(s) to create RNA-guided surveillance complexes 
capable of sequence-specific nucleic acid recognition'”", In agree- 
ment with previous work on LshC2c2 (ref. 9), LbuC2c2 catalysed 
efficient target RNA cleavage only when such substrates could base 
pair with a complementary sequence in the cRNA (Extended Data 
Figs 5-7). Given the promiscuous pattern of cleavage observed for 
C2c2 (Extended Data Fig. 6), we tested the ability of LbuC2c2 to act as. 
actRNA-activated non-specific RNA endonuclease in trans (Extended 
Data Fig. 5b). In marked contrast to non-target cleavage experiments 
performed in cis and consistent with observations for LshC2c2 (ref. 9), 
we observed rapid degradation of non-target RNA (Extended Data 
Fig, 5b); thus, target recognition activates C2c2 for general non- 
specific degradation of RNA. Importantly, the similar cleavage rates 
and near-identical cleavage products observed for both cis on-target 
cleavage and trans non-target cleavage of the same RNA substrate 
implicate the same nuclease centre in both activities (Extended Data 
Fig. 5b). 

crRNA-mediated cleavage of target single-stranded RNA (ssRNA) 
occurs ~80-fold faster than pre-crRNA processing (Fig, 3a). In con- 
trast to pre-crRNA processing, RNA-guided target cleavage is abol- 
ished in the presence of EDTA, indicating that this activity is divalent 
metal ion-dependent (Fig. 3a, Extended Data Figs 5c, 7). Given these 
clear differences, we reasoned that C2c2 might possess two orthogonal 
RNA cleavage activities: cr RNA maturation, and crRNA-directed, non- 
specific RNA degradation. To test this hypothesis, we generated 
mutants within several residues within the conserved HEPN (higher 
eukaryotes and prokaryotes nucleotide-binding domain) motifs of 
LbuC2c2 (refs 9, 22-24) and assessed their pre-crRINA processing and 
RNA-guided RNase activities (Fig. 3, Extended Data Fig. 7d). Double 
and quadruple mutants (R472A, H477A, R1048A and H1053A) 
retained robust pre-crRNA cleavage activity (Fig. 3c), but all HEPN 
mutations abolished RNA-guided cleavage activity without affecting 
crRNA or ssRNA binding ability’ (Extended Data Figs 7d, 8). 

We sought mutations that would abrogate pre-crRNA processing 
activity without disrupting target RNA cleavage. Given that we were 
unable to predict any other potential RNase motifs beyond the HEPN 
motifs, and that C2c2 proteins lack homology to Cpfl, we opted to 
mutate charged residues throughout LbuC2c2 systematically. We iden- 
tified an arginine residue (R1079A) that upon mutation resulted in 
severely attenuated pre-crRNA processing activity (Fig. 3c). This C2c2 
mutant enzyme retained crRNA-binding ability as well as RNA target 
cleavage activity (Extended Data Fig. 8d, Fig. 3d). These results show 
that distinct active sites within the C2c2 protein catalyse pre-crRNA. 
processing and RNA-directed RNA cleavage. 
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a b Figure 3 | LbuC2c2 contains two distinct RNase 
Rare M477 R104 ~H10S3 activities. a, Quantified time-course data of 
cis ssRNA target (black) and pre-crRNA (teal) 
cleavage by LhuC2c2. Exponential fits are shown 
e Been HEPN2 "ji1079 _ as solid lines (n= 3), and the calculated pseudo- 
z first-order rate constants (kop) (mean + s.d.) 
3 a are 9,74£1.15min and 0,12+£0.02min* 
3 for cis ssRNA target and pre-crRNA cleavage, 
8 tbuczes: ~~ respectively. b, LbuC2c? architecture depicting 
erRNA: ~~ the location of HEPN motifs and processing- 
~O- cisssFWNA target cleavage Fatolabeled substrate: 8 A deficient point mutant ¢, d, Representative 
seineuiahasaise Theme 4010 ribonuclease activity of LbuC2c2 mutants for 
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We recognized that the robust RNA-stimulated cleavage of substrates 
by C2c2 in trans might be used to detect specific RNAs within a pool of 
transcripts. While many polymerase-based methods have been devel- 
oped for RNA amplification and subsequent detection, few approaches 
are able to detect target RNA directly without considerable engineering 
or stringent design constraints for each new RNA target”. Asa 
readily programmable alternative, we tested whether the RNA-guided 
trans-endonuclease activity of C2c2 could be harnessed to cleave a 
fluorophore quencher-labelled reporter RNA, thereby resulting in 
increased fluorescence upon target-RNA-triggered RNase activation 
(Fig. 4a). LbuC2c2 was loaded with bacteriophage \-targeting crRNAs 
and tested for its ability to detect the corresponding ) ssRNA targets 
spiked into HeLa cell total RNA. Upon addition of as little as 1-10 pM 
complementary ) target RNA, a substantial crRNA-specific fluores- 
cence increase occurred within 30min (Fig. 4b, Extended Data Fig. 
9a), Control experiments with the C2c2-crRNA complex either alone 
or in the presence of crRNA and a non-complementary target RNA 
resulted in negligible increases in fluorescence relative to a positive 
control (Fig. 4b, Extended Data Fig. 9a). At 10 pM concentration of a 
) target RNA, only ~0.02% of the C2c2-crRNA complex is predicted 
to be in the active state, yet the observed fluorescent signal reflected 
~25-50% cleavage of the reporter RNA substrate, depending on the 
RNA target. Fragment size resolution of the background RNA in these 
reactions revealed considerable degradation, even on highly structured 
tRNAs (Extended Data Fig. 9b). Since reporter RNA cleavage occurs 
in the presence of a vast excess of unlabelled RNA, we conclude that 
LbuC2c2 is a robust multiple-turnover enzyme capable of at least 10* 
turnovers per target RNA recognized. Thus, in contrast to previous 
observations’, crRNA-directed trans cleavage is potent and detectable 
even at extremely low levels of activated protein. 

To extend this LbuC2c2 RNA detection system, we designed a 
erRNA to target endogenous (-actin mRNA. We observed a meas- 
urable fluorescence increase in the presence of human total RNA rel- 
ative to Escherichia coli total RNA, demonstrating the specificity of 
this method (Fig, 4c). Furthermore, given that C2c2 processes its own 
guide, we combined pre-crRNA processing and RNA detection in a 
single reaction by designing tandem crRNA-tepeat-containing spacers 
complementary to target RNAs A and )2. LbuC2c?2 incubated with 
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this unprocessed tandem guide RNA in the detection assay generated 
a substantial fluorescence increase similar in magnitude and sensitiv- 
ity to experiments using mature crRNAs (Fig. 4b, d). Taken together, 
these data highlight the exciting opportunity to take advantage of the 
two distinct RNase activities of C2c2 for a range of biotechnological 
applications (Fig. 4e). 

In bacteria, C2c2 probably operates as a sentinel for viral RNAs’. 
We propose that when invasive transcripts are detected within the 
host cell via base pairing with crRNAs, C2c2 is activated for promis- 
cuous cleavage of RNA in trans (Fig. 4e). This bears notable similarity 
to RNase L and caspase defence mechanisms in eukaryotes, in which 
acellular signal triggers promiscuous ribonucleolytic and proteolytic 
degradation, respectively, within the host cell, leading to apoptosis**’, 
While the RNA targeting mechanisms of type III CRISPR systems gen- 
erally result in RNA cleavage within the protospacer-guide duplex**°, 
recent examples of associated nucleases Csx1 (ref. 23) and Csm6 
(ref. 24) provide compelling parallels between the type VI systems and 
the multi-component type III inference complexes. 

Our data show that CRISPR C2c2 proteins represent a class of 
enzyme capable of two separate RNA recognition and cleavage 
activities. Efficient pre-crRNA processing requires sequence and 
structural motifs within the CRISPR repeat that prevent non- 
endogenous crRNA loading and helps to reduce the potential toxicity 
of this potent RNase. The entirely different pre-crRNA processing 
mechanisms of C2c2 and the type V CRISPR effector protein Cpfl 
indicate that each protein family has converged upon independent 
activities encompassing both the processing and interference func- 
tions of their respective CRISPR pathways. Furthermore, the two 
distinct catalytic capabilities of C2c2 can be harnessed together for 
RNA detection, as the activation of C2c2 to cleave thousands of trans 
RNAs for every target RNA detected enables potent signal amplifica- 
tion. The capacity of C2c2 to process its own guide RNAs from arrays 
could also allow the use of tissue-specific polymerase II promoters for 
guide expression, in addition to target multiplexing for a wide range 
of applications. The C2c2 enzyme is unique within bacterial adaptive 
immunity for its dual RNase activities, and highlights the utility of 
harnessing CRISPR proteins for precise nucleic acid manipulation in 
cells and cell-free systems. 
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Figure 4 | C2c2 provides sensitive detection of transcripts in complex 
mixtures. a, Illustration of LbuC2c2 RNA detection approach using a 
quenched fluorescent RNA reporter. b, Quantification of fluorescence 
signal generated after 30min by wild-type or catalytically dead 
(dHEPN 1/2) LbuC2c2 loaded with either a )2-, 3- or 4-targeting 
crRNA (cr-; as indicated) in the presence of varying concentrations of 
2-4 target ssRNA and human total RNA. RNase A shown as positive 
RNA degradation control (mean +5.d,, 1 =3). AU, arbitrary units. ¢, 
Quantification of fluorescence signal generated by LbuC2c2 loaded with a 
(P-actin targeting crRNA after 3h for varying amounts of human total RNA 
or bacterial total RNA (as a {-actin-null negative control) (mean + s.d., 
n=3). d, Tandem pre-crRNA processing also enables RNA detection 
(mean + s.d., n= 3). e, Model of the type VI CRISPR pathway highlighting 
both of the C2c2 RNase activities. 
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METHODS 

No statistical methods were used to predetermine sample size. The experiments, 
were not randomized, and investigators were not blinded to allocation during 
experiments and outcome assessment. 

€2c2 phylogenic and candidate selection. C2c2 maximum-likelihood phylog- 
cenies were computed using RAXML" with the PROTGAMMALG evolutionary 
model and 100 bootstrap samplings. Sequences were aligned by MAFFT with the 
‘einsi’ method™. 

(C2c2 protein production and purification. Expression vectors for protein puri- 
fication were assembled using synthenic gBlocks ordered from Integrated DNA 
‘Technologies. The codon-optimized C2c2. genomic sequence was N-terminally 
tagged with a Hiss-MBP-TEV cleavage site, with expression driven by a'T7 pro- 
moter. Mutant proteins were cloned via site-directed mutagenesis of wild-type 
(C2c2 constructs. Expression vectors were transformed into Rosetta? E. coli cells 
grown in 2xYT broth at 37°C. E. coli cells were induced during log phase with 
0.5mM ITPG, and the temperature was reduced to 16°C for overnight expression 
of His-MBP-C2c2, Cells were subsequently harvested, resuspended in lysis buffer 
(50mM ‘Tris-HCl pH 7.0, 500mM NaCl, 5% glycerol, 1 mM TCEP, 0.5mM PMSF, 
and EDTA-free protease inhibitor (Roche)) and lysed by sonication, and the lysates 
were clarified by centrifugation, Soluble His-MBP-C2c2 was isolated over metal 
ion affinity chromatography, and protein-containing eluate was incubated with 
"TEV protease at 4°C overnight while dialyzing into ion exchange buffer (50mM_ 
‘Tris-HCl pH 7.0, 250mM KCI, 5% glycerol, 1 mM TCEP) in order to cleave off the 
Hiss- MBP tag. Cleaved protein was loaded onto a Hi'Trap SP column and eluted 
‘over a linear KCI (0,25-1.5 M) gradient. Cation exchange chromatography fractions 
were pooled and concentrated with 30 kDa cutoff concentrators (Thermo Fisher). 
‘The C2c2 protein was further purified via size-exclusion chromatography on an 
$200 column and stored in get filtration buffer (20mM Tris-HCl pH 7.0, 200mM 
KCI, 5% glycerol, 1 mM TCEP) for subsequent enzymatic assays. Expression plas- 
mids are deposited with Addgene. 

Generation of RNA. All RNAs used in this study were transcribed in vitro 
except for crRNA AES461, which was ordered synthetically (Integrated DNA 
‘Technologies) (see Extended Data Table 1, Extended Data Fig. 6). In vitro tran- 
scription reactions were performed as previously described with the following 
modifications: the T7 polymerase concentration was reduced to 101g ml~, and. 
the UTP concentration was reduced to 2.5 mM™. Transcriptions were incubated at 
37°C for 12h to reduce non-template addition of nucleotides and quenched via 
treatment with DNase I at 37°C for 0.5-1h, Transcription reactions were purified 
by 15% denaturing polyacrylamide gel electrophoresis (PAGE), and all RNAs were 
resuspended in cleavage buffer (20mM HEPES pH 6.8, 50mM KCl, 5mM MgCl, 
and 5% glycerol). For radioactive experiments, 5' triphosphates were removed by 
calf intestinal phosphate (New England Biolabs) before radiolabelling and ssRNA 
substrates were then 5'-end labelled using T4 polynucleotide kinase (New England 
Biolabs) and [>-P]ATP (Perkin Elmer) as described previously” 

Pre-crRNA processing assays. Pre-crRNA cleavage assays were performed at 
37°C in RNA processing buffer (20mM HEPES pH 6.8, 50mM KCI, 5mM MgCl, 
10g mI" BSA, 101g ml~? RNA, 0.05% Igepal CA-630 and 5% glycerol) with a 
100-fold molar excess of C2c2 relative to 5'-labelled pre-crRNA (final concentra- 
tions of 100nM and <1 nM, respectively). Unless otherwise indicated, the reaction 
was quenched after Lh with 1.5 RNA loading dye (100% formamide, 0.025% 
(w/v) bromophenol blue and 200 1g ml~* heparin). After quenching, reactions 
‘were denatured at 95°C for Smin before resolving by 12% or 15% denaturing PAGE 
(0.5% TBE buffer). Metal dependence of the reaction was tested by addition of 
EDTA or EGTA to the reaction buffer at concentrations varying from 10 to 100mM. 

Bands were visualized by phosphorimaging and quantified with ImageQuant (GE. 
Healthcare). The percentage cleavage was determined as the ratio of the prod- 
uct band intensity to the total intensity of both the product and uncleaved pre- 
crRNA bands and normalized for background within each measured substrate 
using ImageQuant TL Software (GE Healthcare) and fit to a one phase exponential 
association using Prism (GraphPad). 

Product size mapping and 3! end moiety identification. Cleavage product length 
‘was determined biochemically by comparing gel migration of product bands to 
alkaline hydrolysis and RNase Tl digestion ladders using the RNase T1 Kit from 
Ambion. For the hydrolysis ladder, 15 nM full-length RNA substrates were incu- 
bated at 95°C in 1x alkaline hydrolysis buffer (Ambion) for Smin, Reactions were 
quenched with 1.5 x RNA loading buffer, and cooled to —20°C to immediately stop 
hydrolysis, For the RNase’T! ladder, 15nM full-length RNA substrates were unfolded 
in Ix RNA sequencing buffer (Ambion) at 65°C, Reactions were cooled to ambient, 
temperature, and then 1 U of RNase Tl (Ambion) was added to reaction. After 
15min, reactions were stopped by phenol-chlorofrom extraction and 1.5% RNA 
loading buffer was added for storage. Hydrolysis bands were resolved in parallel to 
cleavage samples on 15% denaturing PAGE and visualized by phosphorimaging 


For’ end moiety identification, products from the processing reaction were ine 
bated with 10U of T4 polynucleotide kinase (New England Biolabs) for 1h at 37 
in processing buffer. Reactions were quenched with 1.5% RNA loading butffer, 
resolved on 20% denaturing PAGE and visualized by phosphorimaging. 

‘Small RNA sequencing analysis. RNA reads from ref. 10 were downloaded from 
SRA runs $RR3713697, SRR3713948 and SRR3713950. The paired-end reads 
were locally mapped to the reference sequences using Bowtie? (ref. 34) with the 


following options: *-reorder-very-fast-local-local’ The mapping was then filtered 
to retain only alignments that contained no mismatch using mapped.py (https:// 
github.com/christophertbrown/mapped) with the “-m 0 -p both” options. BAM 
files of the resulting mapping are in the Supplementary Information, Read coverage 
‘as visualized using Geneious and plotted using Prism (GraphPad). 

‘Target cleavage assays. Target cleavage assays were performed at 25°C or 37°C in 
leavage butler (20mM HEPES pH 68, 50mM KCI, 5mM MgCl and 5% glycerol), 
‘rRNA guides were pre-folded by heating to 65°C for Smin and then slowly cool- 
ing to ambient temperature in cleavage buffer. C2c2-crRNA complex formation 
‘was performed in cleavage buffer, generally ata molar ratio of 2:1 protein to crRNA 
at 37°C for 10min, before adding 5'-end labelled target and/or other non- 
radiolabelled RNA target substrates. Unless otherwise indicated, final concen- 
trations of protein, guide and targets were 100nM, 50nM and <1nM, respec- 
tively, forall reactions, Reactions were quenched with 1.5 x RNA loading dye and 
resolved by 15% denaturing PAGE (0.5% TBE buffer). Bands were visualized by 
phosphorimaging and quantified with ImageQuant (GE Healthcare). The percent- 
age cleavage was determined as the ratio of total banding intensity for all shorter 
products relative to the uncleaved band and normalized for background within 
each measured substrate using ImageQuant TL Software (GE Healthcare) and fit 
toa one phase exponential association using Prism (GraphPad) 

‘rRNA filter-binding assays. Filter binding assays were carried out in RNA pro- 
cessing buffer (20mM HEPES pH 6.8, 50mM KCl, 5mM MgCl, 10g ml" BSA, 
10)1g ml-" yeast tRNA, 0.01% Igepal CA-630 and 5% glycerol). LbuC2c2 was incu- 
bated with radiolabelled cRNA (<0.1 nM) for 1h at 37°C. Tuftyn, Protran and 
Hybond-N + were assembled onto a dot-blot apparatus in the order listed above. 
‘The membranes were washed twice with 50} equilibration buffer (20mM HEPES 
pH6.8, 50mM KCl, 5mM MgCl: and 5% glycerol) before the sample was applied. 
to the membranes. Membranes were again washed with 504 equilibration buffer, 
dried and visualized by phosphorimaging, Data were quantified with ImageQuant 
‘TL Software (GE Healthcare) and fit to a binding isotherm using Prism (GraphPad 
Software), All experiments were carried out in triplicate. Dissociation constants 
and associated errors are reported in the figure legends, 

Electrophoretic mobility-shift assays. To avoid the dissociation of the LbuC2c2- 
HEPN1/dHEPN2-crRNA complex at low concentrations during ssRNA-binding 
experiments, binding reactions contained a constant excess of LbuC2c2-dHEPN1/ 
<AHEPN2 (200nM), and increasing concentrations of crRNA-A and <0.1 nM target 
ssRNA. Assays were carried out in C2c2 EMSA buffer (20mM HEPES pH 6.8, 
50mM KCI, 101g ml BSA, 100g mI yeast tRNA, 0.01% Igepal CA-630 and 
5% glycerol). LbuC2c2-crRNA-A complexes were pre-formed as described above 
for 10min at 37°C before the addition of S’-radiolabelled ssRNA substrate and a 
further incubation for 45 min at 37°C. Samples were then resolved by 8% native 
PAGE at 4°C (0.5% TBE buffer). Gels were imaged by phosphorimaging, quantified 
using ImageQuant TL Software (GE Healthcare) and fit to a binding isotherm 
uusing Prism (GraphPad Software). All experiments were carried out in triplicate. 


Dissociation constants and associated errors are reported in the figure legends. 
Fluorescent RNA detection assay. LbuC2c2-crRNA complexes were preassembled. 
by incubating 1 1M of LbuC2c2 with 500nM of crRNA for 10min at 37°C, These 
complexes were then diluted to 100M LbuC2c2, 50M crRNA, in RNA pro- 
cessing buffer (20mM HEPES pH 6.8, 50mM KCI, 5mM MgCl, 101g ml-' BSA, 
10ug ml" yeast tRNA, 0.01% Igepal CA-630 and 5% glycerol) in the presence of 
185nM of RNAase-Alert substrate (Thermo-Fisher), 100ng of HeLa total RNA and 
increasing amounts of target 60-nucleotide ssRNA (0-1 nM). These reactions were 
incubated in a fluorescence plate reader (Tecan Infinite Pro F200) for up to 120min 
at 37°C with fluorescence measurements taken every Smin (Ae 485m; Aen? 
535nm). Background-corrected fluorescence values were obtained by subtracting 


fluorescence values obtained from reactions carried out in the absence of target, 
ssRNA. Maximal fluorescence was measured by incubating 50nM RNase A with 
185M of RNAase-Alert substrate. For measurement of crRNA-ACTB-mediated 
LbuC2c2 activation by 3-actin mRNA in human total RNA, LbuC2c2-crRNA 
complexes were preassembled by incubating 1 1M of LbuC2c2 with S00nM of 
erRNA-ACTB for 10 min at 37°C and reactions were carried out in the conditions 
above in the presence of increasing amounts (0-1 1g) of ether HeLa cell total RNA 
or E, coli total RNA (as a negative control). These reactions were incubated in a u- 
orescence plate reader for up to 180 min at 37°C with fluorescence measurements 
taken every Smin (Aq: 485nm; eq: 535mm). Background-corrected fluorescence 
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values were obtained by subtracting fluorescence values obtained from reactions 
carried out in the absence of target ssRNA. For coupled pre-crRNA processing and 
RNA detection assays, LbuCas9-crRNA complexes were preassembled by incu- 
bating 1 uM of LbuC2c2 with 500 nM of pre-crRNA-A-2 for 20 min at 37°C and 
reactions carried out as described above in the presence of increasing amounts 
of ssRNA A and ssRNA )2 (0-1 nM each). In each case, error bars represent the 
standard deviation from three independent experiments. 

Background cleavage in total RNA. LbuC2c2-crRNA-M4 complexes were 
assembled as previously described for fluorescence RNA detection assay. 
Complexes were incubated in RNA processing buffer in the presence of 31g, 
total RNA with and without 10nM \4 ssRNA target. After 2h, RNA was 
isolated by trizol extraction and ethanol precipitation. The RNA fragment size 
distribution of resuspended samples was resolved using Small RNA Analysis 
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Kit (Agilent) on a Bioanalyzer 2100 (Agilent) using the manufacturer's protocol. 


Fluorescent intensity curves were normalized in Prism for curve overlay 
(GraphPad Software). 
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Extended Data Figure 1 | Complete phylogenetic tree of C2c2 family for inner split. Scale is in substitutions per site. b, Multiple sequence 


coordinates are 


and C2c2 alignment. a, Maximum-likelihood phylogenetic reconstuction alignment of the three analysed homologues of C2 
of C2c2 proteins. Leaves include GI protein numbers and organism of based on LbuC2c2, 
origin; bootstrap support values, out of 100 resamplings, are presented 
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Extended Data Figure 2 | Purification and Production of C2c2. All C2c2 


homologues were expressed in E. coli as His-MBP fusions and purified by a 
combination of affinity, ion exchange and size exclusion chromatography. 
‘The Ni* affinity tag was removed by incubation with TEV protease. 
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Expected Sizes: 
Lbu: 140 kD 
Lsh: 168 kD 
Lse: 144 KD + 40 KD MBP 


a, b, Representative SDS-PAGE gels of chromatography fractions 
are shown. ¢, The chromatogram from Superdex 200 (16/60) column 


demonstrating that C2c2 elutes as a single peak, devoid of nucleic acid, 
4, SDS-PAGE analysis of purified proteins used. 
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Extended Data Figure 3 | Mapping of pre-crRNA processing by C2c2 

in vitro and in vivo. a, Cleavage site mapping of LseC2c2 and LshCc2e2 
cleavage of a single cognate pre-crRNA array. Cleavage reactions were 
performed with 100nM C2c2 and <1 nM pre-crRNA, b-i, Re-analysis 

of LshC2c2 (b-f) and LseC2c2 (g-i) CRISPR array RNA sequencing 
experiments from ref, 10 (supplementary figs S7 and 5 of ref. 10, 
respectively). All reads (b, g) and filtered reads (55 nucleotides or less; 

‘as per original analysis'”; ¢, h) were stringently aligned to each CRISPR 
array using Bowtie? (see Methods). Detailed views of individual CRISPR 
repeat-spacers are shown for Lsh (df) and Lse (i). Differences in 5’ end 
pre-crRNA processing are indicated by arrows below each sequence. BAM 


alignment files of our analysis are available in Supplementary Information, 


cles poston 


rosa se reyes) 


' processing ate 


‘This mapping clearly indicates that the 5 ends of small RNA sequencing 
reads generated from Lsh pre-crRNAs map to a position 2 nucleotides 
from the base of the predicted hairpin, in agreement with our in vitro 
processing data (a). This pattern holds for all mature crRNAs detected 
from both native expression in L. shahii and heterologous expression in 
E. coli (data not shown, BAM file available in Supplementary Information). 
Unfortunately, the LseC2c2 cRNA sequencing data (used in g-i) is less 
informative owing to low read depth, and each aligned crRNA exhibits a 
slightly different 5' end with little obvious uniformity. The mapping for 
one of the processed repeats (repeat-spacer 2; i) is in agreement with our 
data but only with low confidence due to the insufficient read depth, 
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Extended Data Figure 4 | Pre-crRNA processing by C2c2 is spacer- 
sequence independent, can occur on tandem crRNA arrays, is affected 
by mutations in the 5’ flanking region of the pre-cRNA and produces 
3/ phosphate product. a, Cleavage site mapping of LbuC2c2 cleavage 
ofa tandem pre-crRNA array, Cleavage reactions were performed with 
100nM LbuC2c2 and <1 nM pre-crRNA. A schematic of cleavage 
products is depicted on right, with arrows indicating the mapped C2c2 
cleavage products. b, LbuC2c2 4-mer mutant pre-crRNA processing data 
demonstrating the importance of the 5 single-stranded flanking region 
for efficient pre-crRNA processing. Percentage of pre-crRNA processing 
was measured after 1h (mean ++5.d., n=3). c, Representative LbuC2c2 
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pre-crRNA cleavage time course demonstrating that similar rates of pre- 
crRNA processing occur independent of crRNA spacer sequence pseudo- 
first-order rate constants (koi,) (mean + s.d.) are 0.07 + 0.04min~ and 
0.08 40.04 min- for spacer A and spacer 2, respectively. d, End group 
analysis of cleaved RNA by T4 polynucleotide kinase (PNK) treatment. 
Standard processing assay conditions were used to generate cleavage 
product, which was then incubated with PNK for 1h to remove any 
2/,3!-cyclic phosphates/3' monophosphates. Retarded migration of band 
indicates removal of the charged, monophosphate from the 3! end of 
radiolabelled 5’ product 
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Extended Data Figure 5 | LbuC2c2 catalyses guide-dependent ssRNA 
degradation on cis and trans targets. a, Schematic of the two modes of 
C2c2, guide-dependent ssRNA degradation. b, Cleavage of two distinct 
radiolabelled ssRNA substrates, A and B, by LbuC2c2. Complexes of 
100M C2c2 and 50 nM erRNA were pre-formed at 37°C, and reaction 
was initiated upon addition of <1 nM 5/-labelled target RNA at 25°C. 
‘Trans cleavage reactions contained equimolar (<1 nM) concentrations of 
radiolabelled non-guide-complementary substrate, and unlabelled on- 
target ssRNA. For multiple ssRNA substrates, we observed that LbuC2c2 
catalysed efficient cleavage only when bound to the complementary 
crRNA, indicating that LbuC2c2-crRNA cleaves ssRNA in an RNA- 
guided fashion. This activity is hereafter referred to as on-target or 
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cis cis: 
cis-target cleavage. LbuC2c2-mediated cis cleavage resulted in a laddering 
of multiple products, with cleavage preferentially occurring before uracil 
residues, analogous to LshC2c2 (ref. 9). We repeated non-target cleavage 
reactions in the presence of unlabelled, on-target (crRNA-complementary) 
ssRNA. In contrast to non-target cleavage experiments performed in cis, 
we observed rapid degradation of non-target RNA in trans, The similar 
RNA cleavage rates and near-identical cleavage products observed for both 
cis on-target cleavage and trans non-target cleavage implicate the same 
nuclease centre in both activities. c, LbuC2c2 loaded with crRNA targeting 
spacer A was tested for cleavage activity under both cis (target A labelled) 
and trans (target B labelled in the presence of unlabelled target A) cleavage 
conditions in the presence of 25 mM EDTA. 
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Extended Data Figure 6 | See next page for caption, 
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Extended Data Figure 6 | LbuC2c2 ssRNA target cleavage site mapping. 
a, ssRNA target cleavage assay conducted per Methods demonstrating 
LbuC2c2-mediated ‘cis’ cleavage of several radiolabelled ssRNA substrates 
with identical spacer-complementary sequences but distinct 5’ flanking 
sequences of variable length and nucleotide composition. Sequences of 
ssRNA substrates are shown to the right with spacer-complementary 
sequences for cRNA-A highlighted in yellow. Arrows indicate detected 
cleavage sites. Gel was cropped for clarity. It should be noted that the 
pattern of cleavage products produced on different substrates (for 
example, A.1 versus A.2 versus A.3) indicates that the cleavage site choice 
is primarily driven by a uracil preference and exhibits an apparent lack 

of exclusive cleavage mechanism within the rRNA-complementary 

target sequence, which is in contrast to what is observed for other class 

IL CRISPR single effector complexes such as Cas9 and Cpfl (refs 11, 

21). Notably, the cleavage pattern observed for substrate A.O hints at a 
secondary preference for polyG sequences. b, LbuC2c2 ssRNA target 
cleavage assay as per Methods, using a range of crRNAs that tile the length 


of the ssRNA target. The sequence of the ssRNA substrates used in this 
experiment is shown below the gel with spacer-complementary sequences 
for each crRNA highlighted in yellow. Arrows indicate predicted cleavage 
sites. Above each set of lanes, a small diagram indicates the location of the 
spacer sequence along the target (yellow box) and the cleavage products 
observed (red arrows) or absent (black arrows). Likewise, it should be 
noted that for every crRNA the cleavage product length distribution is, 
very similar, again indicating an apparent lack of exclusive cleavage within 
the crRNA-bound sequence. The absence of a several cleavage products 
ina subset of the reactions might be explained by the presence of bound 
C2c2-crRNA on the ssRNA target, which could sterically occlude access 
to uracils by any cis (intramolecular) or trans (intermolecular) LbuC2c2 
active sites. While proper analysis for protospacer flanking site (PFS) 
preference for LbuC2c2 is beyond the scope of this study, minimal impact 
of the 3 flanking nucleotide was observed. Expected PFS base is noted in 
diagram next to each guide tested in red. 
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Extended Data Figure 7 | Dependence of RNA targeting on cRNA. 
variants, temperature and point mutations. a, LbuC2c2 ssRNA target 
cleavage assay carried out, as per Methods with crRNAs possessing 16-, negligibly impacted cleavage efficiencies, By contrast, shortening the 

20- or 24-nucleotide spacers. b, LbuC2c2 ssRINA target cleavage time- flanking region to 3 nucleotides slowed cleavage rates. d, Effect of point 
course carried out at either 25°C ot 37°C as per the Methods. ¢, LbuC2c2__ mutations on RNase activity of C2c2 in conserved residue mutants within 
ssRNA target cleavage time course carried out as stated in the Methods HEPN motifs for ssRNA targeting, 


with crRNAs possessing different 5/-flanking nucleotide mutations 
Mutations are highlighted in red. One- to two-nucleotide 5’ extensions 
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Extended Data Figure 8 | Binding data for LbuC2c2 to mature 

‘rRNA and target ssRNA. a, Filter binding assays were conducted as 
described in the Methods to determine the binding affinity of mature 
cfRNA-A_GG to LbuC2c2-WT, LbuC2c2-dHEPNI, Lbu HEPN2, 
or LbuC2c2-dHEPN1/dHEPN2. The quantified data were fit to standard 
binding isotherms, Error bars represent the s.d. from three independent 
experiments, Measured dissociation constants from three independent 
experiments (mean +s.d.) were 27.1 7.5 nM (LbuC2c2-W7), 15.2+3.2 nM 
(LbuC2c2-dHEPN1), 11.5 2.5 nM (LbuC2c2-dHEPN2), and 

43.3 11.5 nM (LbuC2c2-dHEPN 1/dHEPN2). b, Representative 
electrophoretic mobility shift assay for binding reactions between 
LbuC2c2-dHEPN 1/dHEPN2: crRNA-A_GG and either ‘on-target’ 

A ssRNA or ‘off-target’ B ssRNA, as indicated. Three independent 
experiments were conducted as described in the Methods. The gel 

was cropped for clarity. ¢, Quantified binding data from b were fitted 

to standard binding isoforms. Error bars represent the s.d. ftom three 
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independent experiments. Measured dissociation constants from three 
independent experiments (mean + s.d.) were 1.62 + 0.43 nM for ssRNA. 

A and not determined (N.D.;>>10 nM) for ssRNA B. d, Filter binding 
assays were conducted as described in the Methods to determine the 
binding affinity of mature crRNA-A_GA to LbuC2c2-WTT and LbuC2c2- 
R1079A. The quantified data were fit to standard binding isotherms. Error 
bars represent the s.d. from three independent experiments. Measured 
dissociation constants from three independent experiments (mean + s.d.) 
were 4.65+£ 0.6nM (LbuC2c2-W'T) and 2,52 + 0.5 nM (LbuC2c2-R1079A). 
Itis of note that these binding affinities differ from a, This difference is 
accounted for in a slight difference in the 5 sequence of the guide with 
panel a guides beginning with a 5/-G(G)CCA... and panel d5'-G(A)CCA. 
While the native sequence guide (5'-G(A)CCA) binds tighter to LbuC2c2, 
no difference is seen in the RNA targeting efficiencies of these guide 
variants (Extended Data Fig. 6c) 
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Extended Data Figure 9 | RNase detection assay X2-ssRNA time the presence or absence of on-target activating M4 ssRNA. Degradation of 
course, a, LbuC2c2:crRNA-)2 was incubated with RNAase-Alert background small RNA was resolved on a small RNA chip in a Bioanalyzer 
substrate (Thermo-Fisher) and 100 ng HeLa total RNA in the presence of 2100 as described in the Methods. Small differences are seen in the 
increasing amounts of )2 ssRNA (0-1 nM) for 2h at 37°C. Fluorescence fragment profile of between apo LbuC2c2 and LbuC2c2:crRNA-M. By 
measurements were taken every 5 min. The 1 nM 2 ssRNA reaction contrast, upon addition of the on-target ssRNA to the reaction, a notable 
reached saturation before the first time point could be measured. Error broadening and shifting of the tRNA peak reveals extensive degradation 
bars represent the s.d. from three independent experiments. b, LouC2c2-_of other structured and nonstructured RNAs present in the reaction upon. 
cfRNA-M or apo LbuC2c2 was incubated in HeLa total RNA for 2h in activation of LbuC2c2 trans activity. 
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Extended Data Table 1 | Oligonucleotides used in this study 


Name Sequence Used in 
bu. pre-cRINA_A_ SOK 5" -GGAUUUAGACCACCCCAAAAAUGAAGGSGACUAAAACAGUGCCAGAGAUGAUGACCCU-3: Fig. 1, 28-6, 386, EOS 
Lse_pre-cRNA_B_AESHOA 5° -GGUAAGAGACUACCUCUAUAUGAAAGASGACUAAAACCAAACAUGAUCUGGGUCAUC™3 Fig. 16, €03 
Lsh_pro-eRNA_A_SCK339 5 -GGAUUUAGACCACCCCAAUAUCGAAGGSGACUAAAACAGGGGCAGAGAUGAUGACCCU-3 Fig. 1. £03 
Lbu_pre-cfRNA inver_SCK321 5° -GGAUUUAGACCAGEGGAAGUAAAAACCCCACUAAAACAGGGCCAGAGAUGAUGACCCU-23, Fig. 28 
Lbu_pre-cfRNASslam_SCK301 5° -GGAUUUAGACCACCCCCAANAAUGAAGGGGGACUAAAACAGGGSCAGAGAUGAUGACCCU-3 Fig. 23 
Ubu pre-cfRNA_Toubble_SCK324 5° -GGAUUUAGACCACCCCAUGAAGGGGACUAARACAGGGECAGAGAUGAUGACCCU-3) Fig. 28 
Lbu_pre-cfRNA,ShubbieSCK225 5S’ -GGAUUUAGACCACCCCAUGAAGGGGACUAARACAGGGECAGAGAUGAUGACCCU-3) Fig. 23 
LLbu_pre-ciRNA_dstem_SCK342__ 5° -GGAUUJUAGACCACCCCAAAAAUGAAGGGCACUAAAACAGGGECAGAGAUGAUGACCCU-3 Fig. 28 
bu pre-cr S_mutt_AESAS7 5 -GGCGUUAGACCACCCCANANAUGAAGEGGACUAAAACAGGGGCAGAGAUGAUGACCCU-3 Fig, 2b, E04 
bu pre-cr §_mut2_AESA08 5 -GGAGCUAGACCACCCCAAANAUGAAGGGGACUAAAACAGGGECAGAGAUGAUGACCCU-3 Fig 2, ED4 
Lbu_pre-cr 6_utd_AES495 5" -GGAUCCAGACCACCCCAAAMAUGAAGGGGACUAAAACAGGGGCAGAGAUGAUGACCCU-3 Fig. 2, E04 
{bu pre-cr_S_mutt_AESA77 5" -GGAUUCGGACCACCCCAAAMAUGAAGEGGACUAAAACAGGGGCAGAGAUGAUGACCCU-3: Fig 2, E04 
bu. pre-cr 5_muts_AESA82 5" -GGAUUUACCCCACCCCAAAAAUGAAGEGGACUAAAACAGGGGCAGACAUGAUGACCCU-3 Fig, 2, E04 
bu pre-cr_5_muts_AESA7S 5" -GGAUUAAUCCACCCCAAAAAUGAAGGGGACLAAAACAGGGGCAGAGAUGAUGACCCU-2 Fig. 2b, ED 
bu pre-cr_§_mut?_AES#80 5" -GGAUUUAGAAAACCCCAAAAAUGAAGGGGACUAAAACAGGGGCAGAGAUGAUGACCCU-3 Fig. 2b, EDE 
ou pre_ot S_muté_AESA08 5 ' -GGAUUUAGACCECCCCAAARAUGAAGEGGACUAAAACAGGGCCAGAGAUGAUGACCCU-3 Fig. 2b, ED 
‘bu pre_stem_mutt_AESS02 5 -GGAUWUAGACCAGCCCAAAAAUGAAGGGCACUAAAACAGEGCCAGAGAUGAUGACCCU-3 Fig. 2b, ED¢ 
{.bu_pre_stom_mut2_AESSOA 5! -GGAUUUAGACCACCGCAAAMAUGAAGCGGACLAAAACAGGGCCAGAGAUGAUGACCCU-3: Fig. 2b, ED¢ 
bu pre_stem_mut8_AES5O0 5 -GGAUUAGACCACACCAAAAAUGAAGCUGACURAAACAGGCCAGAGAUGAUGACCCU-3" Fig. 2, EO 
bu pre_stem_mutt_AESA08 5 “GGAUUUAGACCACCACAAARAUGAAGUGGACUAAAACAGEGCCAGAGAUGAUGACCCU-3" Fig. 2, E04 
‘bu_pre_stom_muts_AESSO 5 -GGAUUUAGACCACUCCAAAAAUGAAGGAGAC UAAAACAGGGCAGAGAUGAUGACCCU-3" Fig. 2b, EOE 
Lu pre et 8 5 “GGAUUUAGACCACCCCAAARAUGAAGESCCALAAAACAGEGGCAGAGAUGAUGACCCU-3: Fig. 2, EO 
Lu pre 5 -GGAUUUAGACCACCCCAAAAAUGAAGGSGACGCAAACAGESCCAGAGAUGAUGACCCU-3 Fig, 2, EOE 
Lbu_pre_et_8_mut8_AESSO7 5 -GGAUUUAGACCACCCCAAAAAUGAAGGSGACUAGCACAGEGGCAGAGAUGAUGACCCU-3: Fig, 2, EDE 
bu_pre_er'_mut4_AESSO8 5 “GGAUUAGACCACCCCAAAAAUGAAGESGACUAAAGUAGGGCCAGAGAUGAUGACCCU-3 Fig, 2, ED 
calbu_A_GG_AESA32 5" -GBCCACCCCAAAAAUGAAGGGGACUAAKACASGGSCAGAGAUGAUGACCCU-3 eDe7 
crlby_B_AESSS1 5" -GGCCACCCCAAAAAUGAAGGGACUAAAACACAAACAUGAUCUGSGUCAUC-3 Fig. 3, £04 
‘A _targol_AES450 5" -GGCACACCCGCAGESGGGAGCCAAAAGSGUCAUCAUCUCUGCCCCCACASCAGAAGCCCC-3 Fig. 9, £047 
8 target AESAS2 5" -GGGAACCCCANGGECAACCSCGAGAAGAUGACCCAGAUCAUGUUUGAGACCUUCAACACCCC- Fig. 3, £046 
ceLby_Lambda2_AESAS3 5" -GGCCACCCCANAAAUGAAGGGGACUAANACAGUGAUAAGUGGAAUGCCAUG-3 Fig. 4b, E08 
cxlby_Lambda_ MOC#10 5" -GGCCACCCCAAAAAUGAAGGGGACUAAAACACUGGUGAACUUCCGAUAGUG-3 Fig. Ab 
cxLby_Lambdad_ MOCA11 5" -GGCCACCCCAAAAAUGAAGGGGACUAAAACACAGAUAUAGCCUGGUGGUUC-3 Fig 4b, E08 
\Lamibdaz_target_MOC28 5" -GGCUCAAUUUUGACAGCGGUCAUGGCAUUCCACUUAUCACUGGCAUCCUUCCACUC-3 Fig 4b, E08 
\Lamindad_target_MOC3S 5" -GGAAAUCAWUCAACACCCGCACUAUCGGAAGUUCACCAGCCAGCCGEAGCACGUU-3 Fig Ab 
\Lamindad_target_MOC37 5" -GGCAAUAAAAAUGCGCCGCCUGANCCACCAGGCUAUAUCUGCCACUCAUUGUUGUGA-3 Fig. 4b ED 8 
crlbu_betaActn_1_AESAS1 5" -GGCCACCCCAAAAAUGAAGGGGACUAAAACACAAACAUGAUCUGGGUCAUC-3 Fig Ae 
pre-rLbu.dimer_SCK324 5 GGAUUUAGACCACCCCAAAAAUGAAGEGGACUAAAACAGGGGCAGAGAUGAUGACCCUAUUUAGACCACCCCAAAAAUGA, Fig 4, E03 
‘RactEACURARACAGUGAUAACUCGAALEECAUG=3 
erLbu lambda SCKS15 5 “GGAUJUAGACCACCCCAAAAAUGAAGCSGACUAARACAGUGAUAAGUSGAAUGCCAUG-3 EDs 
Ubu pre_ct S\Amerl_AESE61 5° -GGAUMUAAUAAACCCCAAAAAUGAAGGGGACUAAAACAGGGCCAGAGAUGAUGACCCU-3. Eos 
Ubu prec Amer2_AESA79 5° -GGAUICGAUCCACECCAAAAAUGAAGOGGACUAAAACAGGGCCAGAGAUGAUGACECU-3 E03 
Ubu pre_et 8\Amerd_SCK343. 5° -GGAUWUAGGAAGCCCCAAANAUGAAGGGGACUAAMACAGGGGCAGAGAUGAUGACCCU-3 E03 
Lbu_pre_et_Amert_AESS9S _ 5° -GGAUJUAGACCAGECCAAAAAUGAAGSCCACUAAAACAGGGCCAGAGAUGAUGACCCU-3, EDs 
‘rLbu_GuideWakt_SCK302 5" -GGCCACCCCANAAAUGAAGGGGACUAANACAUUUUGGCUCCCCCCUGCAAAUGAG-3 EDs 
‘rLbu_GuideWake_SCK303 5" -GGCCACCCCAAAAAUGAAGGGGACUAAAACAACCCUWUUGGCUCCCCECUGCAAA-3 EDs 
crLbu_GuideWake_SCK204 5" -GGCCACCCCANAAAUGAAGGGGACUAANACAGAUGACCCUWUUGGCUCCCCCCUG-3 EDs 
cxLbu_GuideWakt_SCK205 5" -GGCCACCCCAAAAAUGAAGGGGACUAAAACAAGAUGAUGACCCUUUUGGCUCCCC-3 eos 
‘crLbu_GuidaWakS_SCK206 5" -GGCCACCCCANAAAUGAAGGGGACUAAAACAGCAGAGAUGAUGACCCUUUUGGCU-3 0s 
crLbu_GuideWaks_SCK207 5" -GOCCACCCEAAAAAUGAAGGGGACUAAAACAGGGGGCAGAGAUGAUGACCCUUWU-2 0s 
crLbu_GuideWak7_SCK208 5" -GGCCACCCCAAAAAUGAAGGGGACUAAAACACAGAGGGGGCAGAGAUGAUGACCC-3 EDs 
rlbu_GuideWak_ScK309 5" -GGCCACCCCAAAAAUGAAGCCCACUAAAACAUCAGCAGAGCCGGCACAGAUGAUG-3 EDs 
‘Atarget_U_Moca79 5 -GGCUCAUUUGCACEGCCGAGCCAAAAGGCUCAUCAUCUCUGCCCCCUCUGCUGAUGCCCC-3 EDs 
‘A2target_7ont_AESA47 5" -GGCCUGACUGCUCUCAUUUGCAGLUGEGACCCAAAAGEGUCAUCAUCUCUECCCCCUCUGCUGAUGCCEC-3 EDs 
‘AS target 8Ont_AESAC® 5 -GGACCUGUGAAUCCUGACUGCUCUCAULUGCAGUUGGEAGCCAAAAGEGUCAUCAUCUCUGCCCCCUCUGCUGAUECCCC-3' EDS 
‘Ad 5 ts shit_AESeag 5' -GGCACACCCECAGEGUUUAGCCAAAAGGCUCAUCALCUCUGCCCCCUCUeCUGAUGCCCC-3 EDs 
cextbu_A_18n_frune_SCK282 5" -GGCCACCCCAAAAAUGAAGGCGACUAAAACACACAGAUGAUGACCCU-3° EDs 
cxlbu_A 2ént_oxt_SCK283 5! -GGCCACCCCAAAAAUGAAGGCGACUAAAACAAGAGGGEGCAGAGAUGAUGACCCU-3 0S 
crLbu_A mature GA_SCK340 __5' -GACCACCCCAAAARUGAAGGGGACUAAAACASGCGCAGAGAUGAUGACCCU- 3 0s 
crLbu_A mature GGGA_SCKS41 5! -GEGACCACCCCAAAAAUGAAGGEGACUAAAACAGGGGCAGAGAUGAUGACCCU-3 D8 
crLbu_A_mature CCA AESS6? 5’ -CCACCCCAAAAAUGAAGGGACUAAAACAGGSGCAGAGAUGAUGACCCU-3 08 
‘TT Forwars (ONA) 5" STARTACGACTCACTATAGG-3 NA 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature907 


Corrigendum: Imbalance between 
pSmad3 and Notch induces CDK 
inhibitors in old muscle stem cells 
Morgan E. Carlson, Michael Hsu & Irina M. Conboy 


Nature 454, 528-532 (2008); dai:10.1038/nature07034 


In Fig. 1a of this Letter, the immunofluorescence images for myostatin 
and follistatin are inaccurate owing to many versions of this figure in 
multiple revisions of our manuscript. Specifically, the immunofluores- 
cence panels representing anti-myostatin and anti-follistatin staining 
were duplicated. We would like to thank the anonymous reader who 
pointed out this error. We have repeated the age-specific immunoflu- 
orescence experiments for myostatin and follistatin, and the results 
confirm our original conclusion that no age-specific differences are 
detected (Fig. 1), Furthermore, we have performed quantification of 
the pixel density, and although there are areas with higher and lower 
signals for these proteins in both young and old muscle sections, the 
P values suggest no age-specific differences. We thank M. Mehdipour 
for performing the experiments and analysing the data on age-specific 
muscle tissue levels of myostatin and follistatin. 


a Young old Young old 


Follistatin Dystrophin Hoechst Myostatin Dystrophin Hoechst 


Figure 1 | This is a repeat of Fig. 1a of the original Letter. Shown are 
representative images at x20. Dystrophin immunofluorescence (green) 
outlines the muscle fibres in 10-jum skeletal muscle cryosections that are 
immunostained for follistatin and myostatin (red); Hoechst (blue) stains 
all nuclei. P values for mean pixel density of myostatin and follistatin 
(with rabbit IgG control signals subtracted) show no statistically 
significant differences between young and old muscle. 
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THE DARK SIDE OF THE 
HUMAN GENOME 


Scientists are uncovering the hidden switches in our genome that dial gene expression up 
and down, but much work lies ahead to peel back the many layers of regulation. 


BY KELLY RAE CHI 


ifteen years ago, scientists celebrated 
F the first draft of the sequenced human 
enome. At the time, they predicted that 
humans had between 25,000 and 40,000 genes 
that code for proteins. That estimate has con- 
tinued to fall. Humans actually seem to have 
as fewas 19,000 such genes' — a mere 1-2% of 
the genome. The key to our complexity lies in 
how these genes are regulated by the remain- 
ing 99% of our DNA, known as the genome’s 
‘dark matter’ 

From efforts such as the massive Encycl 
pedia of DNA Elements (ENCODE) project 
launched in 2003 by the US National Human 
Genome Research Institute, its clear that copi- 
ous regulatory elements are at play, tuning gene 


expression in ways that scientists are only 
starting to unravel. By uncovering regulatory 
instructions in the genome beyond protein- 
coding genes, scientists are hoping to yield 
new ways to understand and treat disease. “It 
not overstating to say that ENCODE is as sig- 
nificant for our understanding of the human 
genome as the original DNA sequencing of the 
human genome?’ says cell biologist Bing Ren of 
the University of California, San Diego, Insti- 
tute for Genomic Medicine in La Jolla, who is 
a member of the ENCODE team. 

Ren is also part of a subsequent consortium 
called the Roadmap Epigenomics Project’ 
‘These two initiatives — both funded by the 
US National Institutes of Health (NIH) — aim 
to map and predict the existence of elements 
in the genome, including in the vast stretches 


Limited, part of Springer 


The human genome is not packed with ‘junk’ as previously thought, but with regulatory regions that modulate gene activity. 


of non-coding portions, that drive when and 
where genes are expressed. Scientists have 
generated a list of such elements by using bio- 
chemical assays to probe DNA sequences, RNA 
transcripts, regulatory proteins bound to DNA 
and RNA and epigenetic signatures — the chem- 
ical tags on DNA and the proteins packaging it 
— that also affect gene expression. 

So far, the data suggest that there are 
hundreds of thousands of functional regions 
in the human genome whose tasks to control 
gene expression: it turns out that much more 
space in the human genome is devoted to 
regulating genes than to the genes themselves. 
Scientists are now trying to validate each 
predicted element experimentally to ascertain 
its function — a mammoth task, but one for 
which they now havea powerful new tool. 
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> Since the gene-editing technique 
CRISPR-Cas9 entered the scientific arena, 
the speed at which researchers can test func- 
tional elements in the non-coding regions has 
ramped up. Butit is still a daunting endeavour: 
more than 3 million regulatory DNA regions, 
thought to contain some 15 million binding 
sites for regulatory proteins called transcription 
factors, control gene expression in the human 
cell types studied thus far. About 150,000 may 
be active in any given cell type. 

‘These could be crucial to understanding 
disease, because most single-nucleotide 
changes associated with common diseases fall 
in regions outside protein-coding genes, and 
they often overlap with DNA sites highlighted 
by ENCODE as having regulatory function. 
Certain regulatory elements that normally 
drive gene expression are thought to underpin 
the mechanism of cancer, for example. Dis- 
rupting a gene’ regulatory elements, the data 
suggest, could thus have as drastic an impact 
on cell function as disrupting the gene itself. 
Using CRISPR-Cas9, scientists now have an 
opportunity to test that premise by introducing 
targeted mutations into non-coding sequences 
and observing the consequences. 


DECODING A COMPLEX WORLD 
How much of DNAS dark matter hasa function 
in gene control is still up for debate. In 2012, 
ENCODE scientists proposed on the basis of 
biochemical-assay predictions that 80% of the 
non-coding genome has a function’. But this 
figure soon proved to be an overestimate as 
researchers narrowed the definition of 'func- 
tion’ and devised experimental methods, such 
as reporter assays, to test these functions. “The 
number still isnt fully known’, in part because 
the mapping isn’t complete, says Michael 
Snyder, a geneticist at Stanford University in 
California and a member of ENCODE. “Most 
people would say between 10% and 20% of the 
[non-coding] genome is likely to have a func- 
tion where, if you disrupt it, you will affect 
something” 

But regulatory elements havea bewildering 
array of functions and forms, which makes 
tackling them a formidable challenge. Even the 
best-known types, such as spots in the genome 
known as promoters, which lie next toa gene 
where transcription begins, and enhancers 
— regions that when bound by specific 
transcription factors alter the likelihood 
of a gene being read — are hard to study. In 
addition to the sheer number of these sites, 
estimated at 15 million, enhancers may be 
positioned thousands of base pairs away from 
the gene that they control. This makes it tough 
to predict where their target genes are located 
and what they do. 

Thus far, ENCODE and Roadmap have 
offered up important clues, but the real proof 
that these predicted regulatory elements actu- 
ally do something comes from a functional test. 
For genes, this mostly entails deleting them one 
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ata time and observing the consequences ina 
cell assay or animal model. This is less easy to 
do for the non-coding genome because many 
ofthe elements are redundant, and so deleting 
just one might notalter gene expression or pro- 
duce an obvious change. “Its a huge challenge 
that we have at the moment to really distin- 
guish between functional and non-functional 
elements detected by ENCODE" says geneticist 
Ran Elkon of Tel Aviv University in Israel. 

CRISPR-Cas9 is particularly accelerating 
scientists’ exploration of enhancers. The tech- 
nology enables scientists to alter large numbers 
of regulatory elements in a high-throughput 
way, using libraries of RNA guide fragments 
that target and disrupt different regions in the 
genome, to observe the outcome, Not only is the 
method relatively fast, but researchers can also 
run the assays directly in human cells. 

Experiments of this type have already 
turned up some unexpected findings. While 
a postdoc working with cancer biologist 
Reuven Agami at the Netherlands Cancer 
Institute in Amsterdam, Elkon was involved 
in performing the first screen of regulatot 
elements using the advanced editing system’. 
‘The CRISPR-Cas9 approach enabled them to 
test individually those enhancers predicted by 
ENCODE to bind a transcription factor called 
p53. Interest in p53 is high because the protein 
isa known tumour suppressor that is mutated 
in more than 50% of human tumours. The 
researchers were able to pinpoint two enhanc- 
ers from more than a thousand genomic sites 
that affect p53's tumour-suppressing func- 
tion, located near the p53-encoding gene. A 
predicted third enhancer has yet to be located 
because itis far from any gene, let alone one 
related to p53. 

Ina separate screen, the group targeted bind- 
ing sites for oestrogen receptor-a — which is 
implicated in breast cancer — and identi- 
fied three enhancer sequences that influence 
tumour growth; these elements could thus have 
arolein the develop- 


mentofresistanceto “Ie'y a huge 

breast-cancer therapy. challenge to 
At the Broad Insti- aimee 

tute of MIT and Har- between 

vard in Cambridge, iad 

Massachusetts, bio- Tuneionel cont 

engineer Feng Zhang Abate ry 


and his group also 
used CRISPR-Cas9 to 
identify genes essential to the survival of can- 
cer cells. Using a melanoma model, they first 
screened around 18,000 genes in human cells 
to pinpoint ones that might underlie resistance 
to the melanoma drug vemurafenib. Then, in 
a follow-up study published last month’, they 
described a new screen that identified regula- 
tory regions on either side of several resistance 
genes. Their findings fit well with ENCODE 
data that predict regulatory regions at these 
locations —and they also reveal new functional 
elements, says molecular biologist Neville 
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Sanjana, who conducted the research as a for- 
mer postdoc in Zhang's group and now works 
at the New York Genome Center at New York 
University. 

Other CRISPR-Cas9 screening data have 
challenged ENCODE predictions. Richard 
Sherwood of Harvard Medical School in 
Boston, Massachusetts, and his collaborators 
created an approach called a multiplexed edit- 
ing regulatory assay" to screen for non-coding 
regions that might influence gene expression in 
well-known mouse embryonic stem-cell lines. 
Using this technique, they obtained quantita- 
tive information about the extent to which 
these regulatory regions might contribute to 
gene expression. Some of their results are dis- 
cordant with regions flagged by ENCODE as 
potential enhancers because, when mutated, 
these areas did not affect gene expression. 

Moreover, the researchers also discovered 
mysterious sections that they dubbed 
‘unmarked regulatory elements, or UREs, that 
do not fit into any category of functional ele- 
ments. The team is currently exploring how 
widespread these UREs might be in the genome. 
This new type of assay, along with other gene- 
editing-based screens, will play an increasingly 
important part in the validation of ENCODE 
candidates, says Sherwood. 


TECHNIQUE TWEAKS 
Investigators working on the ENCODE and 
Roadmap projects have relied mostly on a bio- 
chemical technique called DNase-seq, which 
sequences and maps ll exposed regions of the 
genome. In these sections, the DNA is relaxed 
instead of tightly coiled around histones, and 
thus is more likely to facilitate transcription- 
factor binding that drives gene activation. 
By mapping these areas, investigators can 
pinpoint candidate enhancers, promoters, 
silencers, insulators and other regulatory ele- 
ments in the non-coding genome (see ‘Spot the 
regulators’). 

‘Another method, ATAC-seq, detects and 
sequences sites in the chromatin that are acces- 
sible to the transposase enzymes used for the 
assay. Both DNase-seq and ATAC-seq produce 
a genome-wide view of regions of open chro- 
matin. According to the researchers, because 
such epigenomic profiles can map the extent to 
which genes are activated in certain cell types, 
they could be useful for clinical decision- 
making, and ATAC-seq is fast enough for this 
purpose’. Many, however, consider a tech- 
nigue known as chromatin immunoprecipi- 
tation (ChIP)-seq to be the most reliable for 
this purpose because it is the only one that can 
identify all potential binding sites for a given 
transcription factor. 

Even so, biochemical assays can only hint at 
function. CRISPR-Cas9 cell screens, by con- 
trast, are more concrete because scientists can 
introducea mutation or deletion ata particular 
site in the genome and observe howit influences 
gene expression, The disadvantage is that these 
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tests cover smaller portions of the genome. Ifthe 
full genome of 3 billion base pairs were repre- 
sented, for example, by three copies of Leo Tol- 
stoy’s classic novel War and Peace (1869), such 
screens would barely cover a single page, San- 
jana says — although he is optimistic that future 
gene-editing approaches will scale this up. 

“In the short term, I think CRISPR will serve 
mainly as a tool to validate functions pre- 
dicted by those biochemical signatures” says 
Ren, Once enough of these kinds of screens 
have been done, their data could be fed intoa 
machine-learning tool to improve its predic- 
tive power, Sherwood says. 

New computational tools are already 
providing scientists with smart ways to inter- 
pret biochemical mapping data. Algorithms 
can predict transcription-factor binding sites, 
which researchers can then probe for func- 
tion, But even with algorithms, predicting 
which enhancers are active in a given context 
isharder in human genomes than in yeast or 
worm genomes, says computational biologist 
Michael Beer of Johns Hopkins University in 
Baltimore, Maryland. 

Beer and his collaborators have developed a 
computational model* to predict which tissue- 
specific networks of gene-regulatory elements 
are operating in a given cell type and to what 
extent they are perturbed in complex diseases. 
They trained their open-source algorithm, 


DNA that 


called deltaSVM, on human lymphoblastoid cell 
lines using gene data from ENCODE in 2012, 
followed by mouse ENCODE data in 2014. 

Scientists have initially focused on cancer 
to probe the links between functional ele- 
ments and disease because cancer is a simpler 
condition to study at the cell level than, say, a 
neuropscyhiatric disorder — cancer cell lines 
reveal simple-to-measure outcomes, such as 
cell multiplication, death or senescence. But 
the data that have streamed in from the Epi- 
genome Roadmap consortium are shifting 
scientists’ thinking about how cancers arise. 
‘A study published last year by geneticist John 
Stamatoyannopoulos of the University of 
Washington in Seattle and his collaborators 
showed? that mutations in a given cancer cell 
type cluster in inaccessible chromatin regions 
rather than in the exposed ones — possibly 
because the open regions can be accessed by 
DNA-tepair enzymes. 

‘The scientists also found that mutation 
density in a tumour is defined by the epi- 
genomic profile specific to each type of cell. 
Consequently, the DNA sequence can be 
informative about tumour origin, which ush- 
ersin the possibility of using epigenomic data to 
trace cancer provenance in patients for whom 
itremains unknown. Itcould also open up new 
approaches to cancer treatment. “Cancer is 
essentially a regulatory or epigenetic program 
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that is superimposed on a cell, and the result of 
that program is the development of geneticand 
genomic instability” says Stamatoyannopoulos. 
“As we've analysed lots of cancer genomes, all of 
these patterns now are starting to come out that, 
were previously not imagined to exist” 

It’s possible that there are still elements in 
the genome that existing assays have missed. 
Afterall, regulatory signals still crop up unex- 
pectedly, such as the UREs in Sherwood’s 
screen. Anda team of scientists led by Harvard 
Medical School immunologist Daniel Tenen 
discovered" a potential new class of regula- 
tors that seem to control whether a gene is 
turned on or off by blocking the enzyme DNA 
methyltransferase 1, which adds methyl 
groups to silence genes, These elements are 
dubbed ‘extracoding RNAs, and because they 
can influence silencing in a gene-specific way. 
have therapeutic potential, Earlier this year, 
neuroscientist Jeremy Day of the University 
of Alabama at Birmingham and his colleagues 
showed in rat neurons that an extracoding 
RNA influences the transcription of a gene 
important for memory formation”, 

‘The ENCODE team will continue to map the 
non-coding space in the genome and expects 
to cover most of the regulatory DNA by 2020, 
Stamatoyannopoulos says. A spatial under- 
standing of how DNA is packaged into a cell, 
and of the 3D folding that positions genes in 
close contact with their regulatory elements, 
will be key to predicting an element's target 
genes, The NIH Common Fund has begun the 
“4D Nucleome project, for instance, which aims 
to predict the target genes for every regulatory 
element. That knowledge will help to fill in 
the picture of how a given regulatory element 
influences health and disease. 

Next-generation sequencing has been — and 
stillis — the technological engine of ENCODE. 
Butlooking ahead, researchers might be able to 
roll out high-resolution live-cell imaging on 
a large scale to watch the state of the genome 
change in real time using specific markers. This 
technology could be disruptive, “Ifwe had a bet- 
ter microscope, we wouldnt be sequencing any- 
more! says Stamatoyannopoulos. a 


Kelly Rae Chi isa freelance science writer 
based in Cary, North Carolina. 
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INDUSTRY Moving from an academic lab to 
acompany role p28 


STUDY ABROAD An Indian scientist in 
Germany gosnature.com/2csidtn 
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VOLUNTEERING 


Pro bono science 


Offering your skills as a volunteer can be rewarding but needs careful thought. 


BY AMBER DANCE 


then physician Alison Smith retired 
in 2012, she wasn't ready to hang up 
her stethoscope. So the primary- 


care doctor in Sussex, UK, decided to volunteer 
for the charity Freedom from Torture in Lon- 
don. She examines asylum seekers who need 
medical evidence of their abuse to strengthen 
their applications to stay in the country. 

In part, the work helps to assuage what she 
describes as “survivor guilt” — the feeling that 
whereas she’s had a great life, others haven't been 
sollucky. Italso makes her feel that she counter- 
balancing the acts of physicians who participate 
in torture, Plus, Smith admits, she has selfish 
reason. “It'san interesting challenge?” she says. 
“[ don't want to let my professional skills go” 

Smith is one of many scientists, engineers 


and health workers worldwide who apply their 
professional abilities and skills — and their 
often-scant spare time — to volunteer missions. 
Not only does it give them satisfaction to deploy 
their skills for good, it can also offer career ben- 
efits such as opportunities for networking or, 
occasionally, for publishing. Researchers who 
feela pull towards philanthropy can contribute 
to projects that involve rights to health or to safe 
living environments, for example. And they 
need not wait until retirement or travel abroad 
to get involved: there are ways to make a posi- 
tive difference in the lives of many with a local 
commitment ofa few hours. But itis important 
for would-be volunteers to select their projects 
carefully — and to grasp what is required of 
them in scientific expertise, time commitment 
and costs they might have to shoulder. 

How to find an organization to volunteer 


13 OCTOBER 2016 


with? Some researchers land projects through 
their universities, and many turn to their dis- 
cipline’s professional associations, The On-call 
Scientists programme, run by the American 
Association for the Advancement of Science 
(AAAS) in Washington DC, matches scien- 
tists with human-rights organizations. It has 
enrolled nearly 1,000 scientists from 58 coun- 
tries; tasks might include helping organizations 
to analyse and understand technical reports, 
designing surveys or developing useful tech- 
nologies. “I'm seeing more and more of these 
kinds of ‘without borders’ types of initiatives 
started by scientific organizations” says Theresa 
Harris, manager of the programme. 

One is the Thriving Earth Exchange (TEX), 
launched in 2013 by the American Geophysi- 
cal Union in Washington DC. It matches Earth 
scientists with communities that need ® 
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> advice, such as cities that are planning for 
the effects of climate change. Natasha Udu- 
gama, director of community partnerships 
for TEX, notes that scientists often begin their 
careers eager to help humankind. Then they 
get sidetracked by the pressure to publish, write 
grants, achieve tenure and reach other career 
milestones, Humanitarian projects, even small 
ones, can help to fulfil that lost-along-the-way 
desire to make the world better. 

It the feeling that inspired Linus Bengtsson, 
an epidemiologist at the Karolinska Institute in 
Stockholm, to found a non-profit humanitar- 
ian organization. “Everybody who deals with 
global health, they want to accomplish some- 
thing in the end, and many feel that research 
is not enough,” he says. In 2008 he co-founded 
Flowminder, which uses data about people's 
locations — from surveys, satellites and mobile 
phones — to map where people are in low-and 
middle-income countries. These data can help 
charities and governments to direct aid or to 
track transmission of disease. About half of 
the people who work with the organization are 
volunteer scientists who may incorporate the 
studies into their own research; the rest are staff. 


IN DEMAND 
Some types of expertise are much in demand, 
including statisticians to crunch numbers, 
environmental scientists to evaluate reports 
on hazards, and physicians to provide medical 
assessments or care. But scientists and engineers 
can also contribute to projects outside their 
specific fields. “There are so many transferable 
skills Harris says. “Anybody who knows how 
to clean up a database is valuable” 

Perhaps one of the most attractive elements of 
volunteer work for researchers is that they can 
make huge difference in others’ lives more rap- 
idly than with most research projects. A mem- 
ber of On-call Scientists, for example, helped 
the small town of Temacapulin in Mexico as it 
faced the danger of flooding as. result of the 
building ofa dam, Harris put the town’ lawyers 
in touch with a water-resources engineer who 
analysed the proposed dam, attended public 
hearings and helped to develop a plan to scale 
down the dam and safeguard the town. Statistics 
Without Borders, an outreach committee of the 
American Statistical Association in Alexandria, 
Virginia, provided volunteers to help Nepal to 
reorganize its census data in the wake of the 
April 2015 earthquake. The new spreadsheet 
allowed fieldworkers to know who was living 
in the communities they visited. And DataKind, 
headquartered in New York City, a global net- 
work of volunteers skilled in working with data, 
assisted US-based Crisis Text Line, a helpline 
for teens. They created an algorithm to identify 
the most urgent texts and prioritize responses. 

For researchers who have time to travel, or 
can combine philanthropy with work trips, 
off-site volunteering is an option. Mary Gray, 
asstatistician and lawyer at the American Uni- 
versity in Washington DC, spent two weeks in 
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earthquake in Nepal. 


Fijiin 2011 when local hospital officials wanted 
to assess whether their medical care was effec- 
tive. Gray gave doctors and lawyers a crash 
course in statistics, which helped them to make 
those judgements. “Ina week or in two you can 
accomplish quite a lot” she says, “You leave a 
resource behind” 

Those who can't travel can often find oppor- 
tunities close to home, Climate scientist Andrew 
Rice at Portland State University in Oregon 
helped the nearby city of Eugene with its car- 
bon-emissions-cutting plan in a TEX project. 
He examined the approach and the maths, pro- 
vided by a city official and a consulting firm, 
on how much of the city’s carbon emissions 
would have to be cut, Heattended a city-council 


meeting, where he spoke about how the team 
came up with its figures, and how the numbers 
related to global climate-change projections. 
‘The city approved an emissions-cutting target 
based on the project in July. Rice liked knowing 
that his scientific skills had immediate impact. 
And Udu-gama of TEX says that cities appre- 
ciate input from an unpaid third-party expert. 

‘Volunteering can also expand one’ skill set. 
Volunteers get a taste of project management 
and learn to work with clients from different 
cultural backgrounds. Teamwork is especially 
crucial in pro bono projects. “You must be will- 
ing to listen.” says Udu-gama. “We stress that 
above all else” 


CAREER BOOST 

Volunteer work can also help a career. Junior 
scientists and students may gain experience 
that they can add to their CVs, and research- 
ersmay be able to publish the work they did or 
integrate it into their current projects. Ashley 
Collier, a graduate student in environmen- 
tal engineering at the University of Colorado 
Boulder, plans to include her volunteer work in 
her thesis on low-cost air-quality monitoring. 
Through TEX, she got involved with a com- 
munity group in Denver, Colorado, that was 
concerned about contamination from per- 
chloroethylene, a chemical used in dry clean- 
ing. Collierand collaborators tested homes for 
perchloroethylene as well as radon gas. They 
found no worrying levels of perchloroethyl- 
ene, but discovered excessive radon in 12 out 
of 15 homes, Now, Collier and the community 
group Taking Neighborhood Health to Heart 
are planning ways to educate residents about 
radon and how they can fix their homes to 
protect themselves. 


VOLUNTEER PREPARATION 
Questions to ask 


When seeking or preparing for volunteer 

work, itis important to communicate with 

the charity about both its expectations 

and yours, and set the plan down clearly, 

possibly even in a contract. “Treat your pro 

bono engagement as you would a paid 

engagement,’ advises Miriam Young, head 

‘of communications and culture for the data- 

driven non-profit DataKind in New York City. 
Here are some questions to ask the non- 

profit, and yourself, before signing on the 

dotted line, 

@s this a credible human-rights 

organization? What are its goals? 

‘© How much time will l be spending on this 

project? How urgentis it, and how long will 

ittake? 

‘© What are my deliverables? 

‘© What's the budget? What expenses will | 

have to pay? 
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‘© Are there confidential data to protect, and 
who will have access? 

@ Do need to do anything to make sure 
findings are admissible in court? 

‘© Will there be any conflict of interest with, 
my current or past employment? 

® Do! need any insurance, such as liability 
or travel coverage? 

‘@ How will the non-profit use these results? 
® Who owns the samples and data? Can! 
publish the results? 

‘© Will have to testify in court? Or talk to the 
media? 

‘@ How will the organization take care of my 
well-being? 

For more guidelines, see Human Rights 
Projects: Guidelines for Scientists and Human 
Rights Organizations (American Association 
for the Advancement of Science, 2012), 
available at go.nature.com/2d7dtxg). 0. 
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Scientists working with Flowminder can 
adjust their research to better fit people's needs, 
Bengtsson says. Flowminder researchers collab- 
orated with a Nepali mobile-phone operator to 
map how people moved around after the 2015 
earthquake, producing data that were useful to 
relief organizations. In turn, the work led toa 
publication for the researchers (R. Wilson etal 
PLoS Curr. Disasters http://doi.org/brbp; 2016). 

Stéphanie Horion, a postdoc at the Univer- 
sity of Copenhagen who studies global environ- 
mental change and Earth observation, found 
that a TEX project pointed her to a resource 
that she is now applying in her research. TEX 
asked her to help the Horn of Africa Regional 
Environmental Centre and Network, a group in 
‘Addis Ababa, Ethiopia, that promotes sustain- 
able development and environmental issues. It 
wanted to understand how a region surround- 
ing Lake Ziway was using water. Italready had 
someone to build.a model of the water usage 
and requirements, but needed additional satel 
lite maps. Horion contacted her network of: 
entists to find the right maps. In the process she 
accessed new vegetation maps, which she is now 
using in her own analyses of ecosystem change. 

‘Networking while volunteering can also bea 
significant career perk. Samarth Bhargav, a soft- 
ware engineer at Inferlytics in Bangalore, India, 
schmoozes with other data-heads at DataKind 
events. “You get to interact with these amazing 
people and pick their brains” he says, And in his 
DataKind project to help a non-governmental 
organization process web-based complaints and 
predict future problems (such asa spate of mes- 
sages about overflowing drains that might her- 
ald floods when monsoons hit) hes learntabout 
urban planning and government operations. 


CAREFUL PREPARATION 
There can be pitfalls, especially ifthe non-profits 
or the scientists who work with them dont com- 
municate enough at the start (see ‘Questions to 
ask’). Organizations may not understand the 
limitations of what scientists can provide if they 
are not spelled out. Harris recalls one non-profit 
thatasked for an On-call Scientists volunteer to 
design a survey, which the scientist completed. 
But when the non-profit asked how the survey 
would be distributed, the scientist was not able 
todo this, and both sides were left disappointed. 

Harris also warns that non-profits often have 
toreorganize their priorities in a hurry, pushing 
projects onto the back burner asa result of polit- 
ical changes in the host nation or emergencies 
such asa tsunami, Scientists should understand 
thisand should also keep in mind that their pro- 
posed solution may not be used. 

Volunteers may have to foot some expenses 
themselves. When Gray went to Fiji, she paid 
her own airfare. Smith estimates that she spends 
about £2,000 (US$2,600) per year on her med- 
ical-doctor registration, liability insurance and 
travel to and from London to work with Free- 
dom from Torture. And volunteers need to be 
realistic about how much time they can offer. 
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Some say it would be easier to volunteer for 
longer periods once one is retired or estab- 
lished in a career, but others say there’s no 
reason not to start early. The summer months 
or between semesters, when academic respon- 
sibilities diminish, are good times. “People 
should at least be thinking about these things 
asa graduate student — what are the societal 
implications of your research?” says Rice. 
‘The experience required varies by organi- 
zation. For example, On-call Scientists seeks 
volunteers with at least a master’s degree plus 
two years of experience in their field. Statis- 
tics Without Borders lets anyone sign up, no 
matter their level of education or experience. 
Foreign-language skills help to boost volun- 
teer’s appeal, Harris notes. And some profes- 
sors may be able to 


“You get to get their students 
interact with involved in pro- 
these amazing jects, thus enabling 
peopleandpick younger scientists 


their brains.” to get experience in 
volunteering. 

Being accepted isnt the only hurdle to suc- 
cessful volunteering. Researchers should make 
sure that the organization or project will have 
the desired impact on people in need, says 
Bengtsson. At AAAS, Harris vets non-prof- 
its by checking their past accomplishments 
and speaking with referees, such as peers at 
other charitable organizations. Scientists can 
check a prospective partner by reviewing the 
organizations reports, and by checking web- 
sites such as Charity Navigator or GuideStar, 
which provide information on and evaluations 
of charities and non-profits. 

‘After that, the choice is straightforward. 
“Do something that you think is interesting” 
Bengtsson advises, “Follow your heart” 


Amber Dance isa freelance writer in Los 


Angeles, California, 


CLARIFICATION 

The general advisory nature of the Careers 
article ‘How to survive as a whistle-blower’ 
(Nature 532, 405; 2016) meant that it 
purposely did not discuss the case used 
as.an example in detail. For clarity, it 
should be noted that asa result of the 
university's initial internal inquiry into the 
case, the co-authors were recommended 
to submit list of errors to the journal in 
question. The editor-in-chief of the journal 
ultimately elected to retract the paper. In 
addition, a subsequent external inquiry 
concluded that although there had been 
misconduct by one researcher, there was 
no misconduct by the travelling supervisor 
referred to in the article, and there was 

no intentional wrongdoing. It also found 
that some of the allegations were not 
substantiated. 
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TRADE TALK 
Lab whiz 


Asa postdoc and 
graduate student, 
Stefanie Marquez- 
Vilendrer acquired 
asuite of lab skills 
Now she works at 
‘AxoGen, a medical- 
technology firm in 
Alachua, Florida. 


How did you approach your job interview? 
looked at the job description and what skills 
they were looking for, and came up with a 
response to how I would fit. Even if I hadn't 
written regulatory documents, I could say I 
had written grants. It’s better to say that than 
to say that you don't have any clue. 


Why did your company choose you? 
They wanted someone who could provide 
support for all sorts of experiments, and to 
help with writing protocols and with univer- 
sity collaborations, Even though I didn't have 
all the techniques, [had shown I could learn 
many new methods in my postdoc. And I 
did have management experience coordinat- 
ing 10-15 people, mainly undergraduates. 
I work closely with students and interns 
here at AxoGen, helping them to plan what 
experiments they need to run and what rea- 
gents they need to buy. 


Do you wish you'd done anything differently 
during your training? 

For graduate school I found a lab that was 
interesting to me, rather than a lab that pub- 
lished a lot or had a lot of funding; those were 
questions I probably should have asked about 
when I was choosing. An important question 
to ask about potential mentors is how often 
they participate in conferences. Asa postdoc, 
Ididn'tattend a single conference outside the 
University of Florida. This hurt my chances 
of getting a job quickly, because I didn't make 
any industry contacts. 


How can trainees best prepare for their job 
search? 

Idontt regret doing a postdoc, but I wish that 
had had different opportunities presented to 
me. We have quitea few interns here; they are 
getting experience that will make them more 
competitive if they decide to go into industry. 
I didn't have that available. It was that lack of 
industry experience that was holding me back 
—itwas hard for me to get my footin the door. 


INTERVIEW BY MONYA BAKER 
This interview has been edited for length and 
clarity. See go.nature.com/2dbhSus for more. 
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BY AIDAN DOYLE 


i everyone! I'ma senior consultant 
H with Galactic Empire Management 

Consulting, I've been brought on 
board to analyse the previous project’ failure 
and to help bring the new project back on 
schedule. We have a tight deadline, but I'm 
sure if we all work together we can achieve 
those key milestones! I look forward to 
working with all of you! 

Ht 
Td appreciate it if everyone took a few 
minutes to answer the attached survey 
tions. Were undertaking an Organizational 
Culture Inventory (OCI) of the Empire's 
workforce. Its widely recognized that work- 
place culture underpins everything we do, 
including how we approach our work and 
interact with each other. Improving work- 
place culture can help us innovate and suc- 
ceed asan organization. 
Ht 

After conducting a preliminary review, it 
looks as though the new project has failed 
to learn from previous mistakes. In our 
experience, when it comes to integrat- 
ing new planets into an empire, building 
a planet-destroying space station is sim- 
ply not cost efficient. Other clients have 
achieved greater results through building 
hospitals, schools and cultural centres. Our 
initial report also highlights key failures in 
process management and threat assess- 
ment. I trust you'll all make time to thor- 
oughly review the document. Also, please 
note, as of Wednesday, the use of emoji will 
no longer be permitted in work-related 
messages, 


ues- 


ht 
I've uncovered disturbing reports about 
staff being discriminated against due to 
their lack of faith. Id like to remind every- 
one that religious discrimination in the 
workplace will absolutely not be tolerated. 
Pil be scheduling a series of mandatory sen- 
sitivity training workshops over the next 
couple of weeks. 
tht 

A happy workplace is a productive work- 
place! When one of your co-workers helps 
you, remember to let them know you 
appreciate their actions. You now have the 
opportunity to thank your colleagues by 
using the attached Thank You Template! 
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Prepare for excellence! 


Not only will recipients love getting your 
thanks, but you also have the chance to win 
one of five cantina voucher rewards in a 
monthly draw! 

tet 
After reviewing a number of military 
supplier contracts and conducting an analy- 
sis of key battle performance reports, we've 
reached the conclusion that supplier claims 
about the accuracy of their weapons reduc- 
ing the need for troop training have been 
exaggerated. Asa first step to ensuring our 
armed forces maintain their famed com- 
bat excellence, we recommend a complete 
systems upgrade of the imperial personnel 
training database. 

tet 
Itis with great disappointment that inform 
you that members of the engineering divi- 
sion have been disciplined for posting 
criticisms of the project's revised work 
schedule on social media. The new sched- 
ule was developed with the input of leading 
business analysts and experienced pro- 
ject managers and is entirely achievable. 
Ishouldn’t have to remind staff that the new 
schedule has the Emperor's approval. 'lalso 
take this opportunity to reiterate that staff 
need to seek management approval before 
posting on social media. 

thet 
Td like to remind middle management that 
summary termination of staff must receive 
approval from the Executive Leadership 
‘Team. We don't want to risk violating work- 
place agreements that could render the 
Empire liable to compensation claims and 
industrial action. 

tet 
Please find attached an updated version of 
the imperial guide to using the tractor beam. 
‘This guide has been developed in consulta- 
tion with the beam's manufacturer, along 
with workplace safety experts. Staff are 
reminded that using the tractor beam for 
personal reasons is strictly prohibited. 

ttt 
‘An analysis of the imperial performance- 
based pay system has discovered the practice 
of rewarding staff with bonuses for subdu- 
ing rebel systems has led to staff abusing this 
system. Some staff have been found guilty 
of encouraging newly integrated planets 
to embark on a rebellion, then reconquer- 
ing them to claim the bonus multiple times. 
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Pending further review, this pay bonus has 
been discontinued. 

tet 
Ournew values are here! I'm pleased to share 
the Empires new values: Service Excellence, 
Ownership, Teamwork and Wellbeing. 
‘These have been developed by the Execu- 
tive Leadership Team based on feedback 
provided by staff. These values will help 
clarify our mission to bring peace and pros- 
perity to all of the Galaxy. I encourage you 
all to reflect on our new values and consider 
how you can bring them to life through 
your behaviours. We will be asking for your 
thoughts and input on behaviours over the 
coming weeks. 

tet 
Engineering staff are reminded to regularly 
change their password. Itis vitally important 
that classified documents do not fall into the 
wrong hands. 

tet 
‘The Empire has raised the Imperial Terror- 
ism Public Alert Level from medium to high. 
In response to this heightened security alert 
level, the Executive Leadership Team has 
mandated the use of colour-coded security 
badges for all staff, contractors and visitors 
entering imperial work bases. The badges are 
coded as follows: blue for members of the 
imperial armed forces, red for contractors 
and bounty hunters, and yellow for visitors. 
‘Thank you for your cooperation. 

het 
Your feedback is requested on the three 
finalists in the Design a New Imperial Logo 
contest. The winner will have the honour of 
seeing their design emblazoned on imperial 
spacecraft and military uniforms. 

eet 
Please find attached the final report on the 
failures of the previous project. In my expe- 
rience, these reports are regrettably often 
ignored, but I hope you will take the time to 
thoroughly review it and implement its rec- 
ommendations. The fate of the Galaxy is in 
your hands! Thank you for all your support 
and havea great day! m 


Aidan Doyle is an Australian writer and 
computer programmer, He has visited 
‘more than 90 countries and his experiences 
include teaching English in Japan, 
interviewing ninjas in Bolivia and going 
ten-pin bowling in North Korea. 
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